Receiving apparatus, receiving method, transmitting apparatus, and transmitting method

ABSTRACT

A receiving apparatus includes circuitry configured to receive a digital broadcast signal which uses an internet protocol (IP) transfer method. Based on control information included in the digital broadcast signal, the circuitry acquires at least one of a broadcast component or a communication component, and controls an operation of each section which performs a predetermined process relating to the acquired at least one component. The broadcast component and the communication component are associated with a component hierarchy including a first hierarchy level, a second hierarchy level, and a third hierarchy level for each component category.

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. Ser. No. 16/365,920, filedMar. 27, 2019, which is a continuation of U.S. Ser. No. 15/129,034,filed Sep. 26, 2016, which is a National Stage of PCT/JP2015/001907,filed Apr. 3, 2015 and claims the benefit of Japan Application No.2014-078032, filed Apr. 4, 2014. The entire contents of U.S. Ser. No.16/365,920 and U.S. Ser. No. 15/129,034 are incorporated herein byreference.

TECHNICAL FIELD

The present technique relates to a receiving apparatus, a receivingmethod, a transmitting apparatus, and a transmitting method and, inparticular, relates to a receiving apparatus, a receiving method, atransmitting apparatus, and a transmitting method in which it ispossible to select an optimum component from components which aretransferred in a broadcast or by communication.

BACKGROUND ART

In recent years, in the field of digital broadcasting, in addition toservices which use broadcasting, hybrid type services which arecoordinated with communication are being introduced (for example, referto PTL 1). In such a hybrid type service, it is possible to transfercomponents such as video, audio, or subtitles for providing the serviceusing either broadcasting or communication.

CITATION LIST Patent Literature [PTL 1] Japanese Unexamined PatentApplication Publication No. 2011-66556 SUMMARY OF INVENTION TechnicalProblem

While it is possible to supply various types of services due to theintroduction of hybrid type services, it is necessary to select anoptimum component from components which are transferred in a broadcastor by communication. However, a technical method for selecting anoptimum component from components which are transferred in a broadcastor by communication has not been established.

It is desirable to be able to select an optimum component fromcomponents which are transferred in a broadcast or by communication.

Solution to Problem

A receiving apparatus of a first embodiment of the present technique isa receiving apparatus including circuitry configured to receive adigital broadcast signal which uses an internet protocol (IP) transfermethod. Based on control information included in the digital broadcastsignal, the circuitry acquires at least one of a broadcast component ora communication component, and controls an operation of each sectionwhich performs a predetermined process relating to the acquired at leastone component. The broadcast component and the communication componentare associated with a component hierarchy including a first hierarchylevel, a second hierarchy level, and a third hierarchy level for eachcomponent category. The third hierarchy level is for adaptivelyselecting the broadcast component or the communication component. Thesecond hierarchy level is for synthesizing the component which isadaptively selected in the third hierarchy level and a component whichis not a target in the third hierarchy into one synthesized component.The first hierarchy level is for selecting one of the synthesizedcomponent which is synthesized in the second hierarchy level, thecomponent which is adaptively selected in the third hierarchy level, anda component which is not a target in the second hierarchy level and thethird hierarchy level.

The receiving apparatus may be an independent apparatus, or may be aninner section block which configures one apparatus.

A receiving method of the first embodiment of the present technique is areceiving method which corresponds to the receiving apparatus of thefirst embodiment of the present technique.

In the receiving apparatus and the receiving method of the firstembodiment of the present technique, a digital broadcast signal whichuses an IP transfer method is received. Based on control informationincluded in the digital broadcast signal, at least one of a broadcastcomponent and a communication component is acquired and an operation ofeach section of the receiving apparatus which performs a predeterminedprocess relating to the acquired at least one component is controlled.The broadcast component and the communication component are associatedwith a component hierarchy including a first hierarchy level, a secondhierarchy level, and a third hierarchy level for each componentcategory. The third hierarchy level is for adaptively selecting thebroadcast component or the communication component. The second hierarchylevel is for synthesizing the component which is adaptively selected inthe third hierarchy level and a component which is not a target in thethird hierarchy into one synthesized component. The first hierarchylevel is for selecting one of the synthesized component which issynthesized in the second hierarchy level, the component which isadaptively selected in the third hierarchy level, and a component whichis not a target in the second hierarchy level and the third hierarchylevel.

A transmitting apparatus of a second embodiment of the present techniqueis a transmitting apparatus including circuitry configured to acquirecontrol information, acquire a broadcast component of a service, andtransmit the control information in addition to the broadcast componentin a digital broadcast signal which uses an IP transfer method. Thebroadcast component and a communication component are associated with acomponent hierarchy structure including a first hierarchy level, asecond hierarchy level, and a third hierarchy level for each componentcategory. The third hierarchy level is for adaptively selecting thebroadcast component or the communication component. The second hierarchylevel is for synthesizing the component which is adaptively selected inthe third hierarchy level and a component which is not a target in thethird hierarchy level into one component. The first hierarchy level isfor selecting one of the component which is synthesized in the secondhierarchy level, the component which is adaptively selected in the thirdhierarchy level, and a component which is not a target in the secondhierarchy level and the third hierarchy level.

The transmitting apparatus may be an independent apparatus, or may be aninner section block which configures one apparatus.

A transmitting method of a second embodiment of the present technique isa transmitting method which corresponds to the transmitting apparatus ofthe second embodiment of the present technique.

In the transmitting apparatus and the transmitting method of the secondembodiment of the present technique, control information is acquired, abroadcast component of a service is acquired, and the controlinformation is transmitted in addition to the broadcast component indigital broadcast signal which uses an IP transfer method. The broadcastcomponent and a communication component are associated with a componenthierarchy structure including a first hierarchy level, a secondhierarchy level, and a third hierarchy level for each componentcategory. The third hierarchy level is for adaptively selecting thebroadcast component or the communication component. The second hierarchylevel is for synthesizing the component which is adaptively selected inthe third hierarchy level and a component which is not a target in thethird hierarchy level into one component. The first hierarchy level isfor selecting one of the component which is synthesized in the secondhierarchy level, the component which is adaptively selected in the thirdhierarchy level, and a component which is not a target in the secondhierarchy level and the third hierarchy level.

Advantageous Effects of Invention

It is desirable to select an optimum component from components which aretransferred in a broadcast or by communication according to the firstembodiment and the second embodiment of the present technique.

Here, the effects described here are not necessarily limited and may beany of the effects described in the present disclosure.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a diagram which shows a protocol stack of digital broadcastingwith an IP transfer method.

FIG. 2 is a diagram which shows a relationship between a signal of abroadcast wave of digital broadcasting which uses an IP transfer methodand an ID system of an IP transfer method.

FIG. 3 is a diagram which shows a configuration of a broadcast wave ofdigital broadcasting with an IP transfer method.

FIG. 4 is a diagram which shows a component layer structure.

FIG. 5 is a diagram of a component layer structure.

FIG. 6 is a diagram which shows a configuration example of signalinginformation.

FIG. 7 is a diagram which shows a specific example of a component layer.

FIG. 8 is a diagram which shows an example of a screen layout.

FIG. 9 is a diagram which shows an example of a screen transition.

FIG. 10 is a diagram which shows a configuration of signalinginformation.

FIG. 11 is a diagram which shows component layer correspondence usingMPD.

FIG. 12 is a diagram which shows a specific operative example ofcomponent layer correspondence using MPD.

FIG. 13 is a diagram which shows a method for coordinating SPD and MPD.

FIG. 14 is a diagram which shows a specific operative example ofcoordination of SPD and MPD.

FIG. 15 is a diagram which shows syntax of an SPD.

FIG. 16 is a diagram which shows syntax of a component layer descriptor.

FIG. 17 is a diagram which shows syntax of an SPD.

FIG. 18 is a diagram which shows a configuration of one embodiment of abroadcast communication system to which the present technique isapplied.

FIG. 19 is a diagram which shows a configuration of one embodiment of atransmitting apparatus to which the present technique is applied.

FIG. 20 is a diagram which shows a configuration of one embodiment of adistribution server to which the present technique is applied.

FIG. 21 is a diagram which shows a configuration of one embodiment of areceiving apparatus to which the present technique is applied.

FIG. 22 is a flowchart which illustrates a transmission process.

FIG. 23 is a flowchart which illustrates a distribution process.

FIG. 24 is a flowchart which illustrates a receiving process.

FIG. 25 is a diagram which shows a configuration example of a computer.

DESCRIPTION OF EMBODIMENTS

Below, description will be given of embodiments of the present techniquewith reference to the diagrams. Here, description will be performed inthe following order.

1. Summary of Digital Broadcasting using IP Transfer Method

2. Component Layer Structure

-   -   (1) Details of Component Layer Structure    -   (2) Specific Examples of Component Layer Structure

3. Configuration of Signaling Information

-   -   (1) Details of Signaling Information    -   (2) Component Layer Correspondence using MPD    -   (3) Coordination of SPD and MPD    -   (4) Syntax

4. System Configuration

5. Flow of Specific Processes Executed in each Apparatus

6. Configuration of Computer

<1. Summary of Digital Broadcasting Using IP Transfer Method> (ProtocolStack)

FIG. 1 is a diagram which shows a protocol stack of digital broadcastingwith an IP transfer method.

As shown in FIG. 1, the lowest hierarchy is set as a physical layer anda frequency band of a broadcast wave which is assigned for a service(channel) corresponds thereto. A higher hierarchy adjacent to thephysical layer is set as an IP layer by interposing a base band packetstream (BBP stream). The BBP stream is a stream which includes a packetin which various types of data in an IP transfer method are stored.

The IP layer is equivalent to an internet protocol (IP) in a TCP/IPprotocol stack and an IP packet is specified by an IP address. A higherhierarchy adjacent to the IP layer is set as a UDP layer and even higherhierarchies than the UDP layer are set as an RTP and a FLUTE/ALS. Thatis, in a digital broadcast with an IP transfer method, a packet where aport number of a user datagram protocol (UDP) is designated istransmitted and for example, a real-time transport protocol (RTP)session or a file delivery over unidirectional transport (FLUTE) sessionis established.

A higher hierarchy adjacent to the FLUTE/ALS is set as a fragmented MP4(fMP4) and higher hierarchies adjacent to the RTP and fMP4 are set asvideo data (Video), audio data (Audio), subtitles data (Closed Caption),and the like. That is, an RTP session is used in a case of transferringvideo data or audio data in a stream format and a FLUTE session is usedin a case of transferring video data or audio data in a file format.

In addition, the higher hierarchy of the FLUTE/ALS is set as NRTcontent, ESG, and SCS, and the NRT content, ESG, and SCS are transferredby a FLUTE session. The NRT content is content which is transferred in anon-real time (NRT) broadcast and playback is performed after the NRTcontent is temporarily accumulated in a storage of a receiving device.Here, the NRT content is one example of content and files of othercontent may be transferred by a FLUTE session. ESG is an electronicservice guide.

Service channel signaling (SCS) is signaling information in a serviceunit and is transferred by a FLUTE session. For example, a serviceparameter description (SPD), a user service description (USD), a mediapresentation description (MPD), and a session description protocol (SDP)are transferred as SCS.

Low layer signaling (LLS) is signaling information of a low layer and istransferred on a BBP stream. For example, service configurationinformation such as a service configuration description (SCD), anemergency alerting description (EAD), and a region rating description(RRD) is transferred as LLS.

(ID System in IP Transfer Method)

FIG. 2 is a diagram which shows a relationship between a signal of abroadcast wave of digital broadcasting which uses an IP transfer methodand an ID system of an IP transfer method.

As shown in FIG. 2, a network ID (network_id) is assigned to a broadcastwave (broadcast network) which has a predetermined frequency band (6MHz). One or a plurality of BBP streams which are distinguished by a BBPstream ID (BBP_stream_id) are included in each broadcast wave. A BBPstream is configured by a plurality of BBP packets formed of a BBPheader and a payload.

One or a plurality of services which are distinguished by a service ID(service_id) are included in each BBP stream. Each service is configuredby one or a plurality of components. Each component is, for example,information which configures a program such as video data, audio data,and subtitles.

In this manner, a configuration which corresponds to a combination(referred to below as a “triplet”) of a network ID (network_id), atransport stream ID (transport_stream_id), and a service ID (service_id)which are used in an MPEG2-TS method is adopted as an ID system of an IPtransfer method and a BBP stream configuration and a serviceconfiguration in a broadcast network are shown according to the triplet.However, in an ID system of an IP transfer method, a BBP stream ID isused instead of a transport stream ID.

Since it is possible to obtain a match with an MPEG2-TS method which iscurrently widespread by using such an ID system, for example, it ispossible to easily obtain correspondence with a simulcast when shiftingfrom an MPEG2-TS method to an IP transfer method.

(Configuration of Broadcast Wave by IP Transfer Method)

FIG. 3 is a diagram which shows a configuration of a broadcast wave ofdigital broadcasting with an IP transfer method.

As shown in FIG. 3, a plurality of BBP streams are transferred in abroadcast wave (‘Network’ in the diagram) which has a predeterminedfrequency band. In addition, a network time protocol (NTP), a pluralityof service channels, an electronic service guide (ESG Service), and LLSare included in each BBP stream. However, the NTP, the service channel,and the electronic service guide are transferred according to a UDP/IPprotocol, but the LLS is transferred on a BBP stream. In addition, theNTP is timing information and is common to a plurality of servicechannels.

A component such as video data or audio data and SCS such as SPD or SDPare included in each service channel. In addition, a common IP addressis given to each service channel and it is possible to package acomponent, a control signal (SCS), or the like for one or each of aplurality of service channels using the IP address. Here, in FIG. 3, thenetwork, the BBP stream, and the component correspond to FIG. 2;however, the service channel corresponds to the service in FIG. 2.

<2. Component Layer Structure> (1) Details of Component Layer Structure

FIG. 4 is a diagram which shows a component layer structure.

As shown in FIG. 4, each component of video, audio, and subtitles(captions) is configured by three hierarchies of a selective layer, acomposite layer, and an adaptive layer. In the hierarchy structure, acomposite layer is arranged as a higher hierarchy of an adaptive layerand a selective layer is arranged as a higher hierarchy of the compositelayer. Below, description will be given of the details of each of thelayers in sequence.

(Adaptive Layer)

Firstly, description will be given of the adaptive layer. As shown inFIG. 4, in an adaptive layer which is the lowest hierarchy, one symbolwith a circular shape with a different pattern represents a component(broadcast Component) which is transmitted by a broadcast wave and theother symbol represents a component (broadband Component) which isdistributed via a network. These components are distributed by so-calledadaptive streaming and a plurality of components with different bitrates are prepared. Here, in the example in FIG. 4, one broadcastcomponent is prepared for each category; however, a plurality may beprepared. In addition, here, the classifications of components such asvideo, audio, or subtitles are referred to as categories.

In the adaptive layer, one component is selected from a plurality ofcomponents by a straight line which swings left to right on a dotted arcin the diagram functioning as a switch. That is, an adaptive layer is ahierarchy for dynamically switching a plurality of components tofunction as one component based on an adaptive determination of areceiving device in each component category. However, in a case in whichthere is only one component, a selection using the switch is notperformed and this component only is selected every time.

In addition, it is possible to designate a transfer path or a bit rateof a component as an attribute of an adaptive selection target in theadaptive layer. For example, broadcast or communication is designated asan attribute value of a transfer path. In addition, for example, 10 Mbpsor the like is designated as a bit rate. Furthermore, for example, anattribute relating to the screen resolution or the robustness of thephysical layer (PHY) may be designated. Here, the attribute of anadaptive selection target described above is one example and it may bepossible to designate another attribute.

Since it is possible to designate such an attribute, an optimumcomponent is adaptively selected and switched every set period (forexample, 10 seconds) in a receiving device and adaptive streamingdistribution is realized. In detail, in a case in which the transferpath of a component is communication only, it is possible to select acomponent with an optimum bit rate according to a filled up status of areceiving buffer of a receiving device which changes according to thecongestion status of a communication path (for example, a receivingbuffer 421 in FIG. 21 which will be described below).

In addition, in a case in which a broadcast is also included in additionto communication as the transfer path of a component, it is possible todetermine the selection with the communication component according tothe bit rate of a broadcast component. Here, for example, an operationis assumed to be in a case in which each component of 1 Mbps, 2 Mbps, 5Mbps, 10 Mbps, and 20 Mbps is prepared as a component which isdistributed via a network and only a component of 8 Mbps is prepared asa component which is transmitted by a broadcast wave.

In this case, when it is possible for the receiving device to receive acommunication component of 10 Mbps or 20 Mbps, since the bit rate ishigher for the communication component than the broadcast component, thecommunication component is received as a priority among the components.In addition, when it is possible to receive a communication component of5 Mbps without being able to receive communication components of 10 Mbpsand 20 Mbps due to congestion, it is possible to operate such that abroadcast component of 8 Mbps which is able to be stably received isselected instead of the communication component. Here, in a case inwhich a plurality of broadcast components are prepared, for example,changes in a broadcast signal of carrier/noise (C/N) are measured andthe broadcast component may be adaptively switched according to themeasurement result.

(Composite Layer)

Next, description will be given of the composite layer. As shown in FIG.4, in a composite layer which is a hierarchy above the adaptive layer, aplurality of components which are adaptively selected by the adaptivelayer are synthesized into one component. That is, the composite layeris a hierarchy for combining a plurality of components in a synthesistarget component group (referred to below as a “composite componentgroup”) to function as one component (a synthesized component) in eachcomponent category. However, in a case in which the synthesis targetcomponent group is only one component, a synthesizing operation is notnecessary.

In addition, it is possible to designate a scalable or athree-dimensional video (3D), a tile, a layer, mixing, or the like as anattribute of the synthesis target in the composite layer. Theseattributes show types of combinations and it is possible to designate anattribute value which shows an element in such a combination.

“Scalable” is an attribute which shows that a synthesis target componentis a component where scalable encoding is carried out. “Base” or“Extended” is designated as an attribute value of the scalableattribute.

For example, in a case of supplying a video with 4K resolution, anencoded signal (a video component) which is equivalent to a video with2K resolution where “Base” is designated as an attribute value of ascalable attribute is transmitted by a broadcast wave and is distributedvia a network by designating “Extended” as an attribute value of ascalable attribute for a video encoded signal (a video component) whichis equivalent to the difference between 4K resolution and 2K resolution.Due to this, it is possible for a receiving device which handles 4Kresolution to display a video with 4K resolution by synthesizing a basestream which is transferred in a broadcast and an extended stream whichis transferred by communication. On the other hand, a receiving devicewhich does not handle 4K resolution only uses the base stream which istransferred in a broadcast and displays a video with 2K resolution. “3D”is an attribute which shows that the synthesis target component is acomponent used for 3D. “Right” or “Left” is designated as an attributevalue of the 3D attribute. For example, a video signal (a videocomponent) for a right eye where “Right” is designated as an attributevalue of the 3D attribute is transmitted in a broadcast wave and a videosignal (a video component) for a left eye where “Left” is designated asan attribute value of the 3D attribute is distributed via a network. Dueto this, it is possible for a receiving device which handles 3D todisplay a 3D video by synthesizing the video signal for a right eye andthe video signal for a left eye.

“Tile” is an attribute which shows that the synthesis target componentis a component for tiling. For example, “Tile A1”, “Tile A2” . . . ;“Tile B1”, “Tile B2” . . . ; or “Tile C1”, “Tile C2” . . . aredesignated as an attribute value of the tile attribute.

For example, in “Tile A1”, “Tile A2” . . . , “Tile A” shows that thetype of tiling is type A. In a case of a method in which the type Atiling displays a video for two tiles by lining the tiles uphorizontally, a video signal (a video component) for the tile which isarranged on the left side where “Tile A1” is designated as an attributevalue of a tile attribute is transmitted by a broadcast wave and a videosignal (a video component) for the tile which is arranged on the rightside where “Tile A2” is designated as an attribute value of a tileattribute is distributed via a network. Due to this, it is possible fora receiving device which handles the tiling display to display a videowhich corresponds to a type A tiling by synthesizing a video signal forthe left side tile and a video signal for the right side tile.

In the same manner, for example, in a case of a method in which type Btiling displays a video for four tiles by lining up and arranging thetiles to be 2×2, since a video signal (a video component) for the fourtiles is transferred in a broadcast or by communication, it is possibleto display a 2×2 video which corresponds to the type B tiling bysynthesizing the video for the tiles in the receiving device. Inaddition, for example, in a case of a method in which type C tilingdisplays a video for a plurality of tiles by arranging the tiles as apanorama video (for example, 360 degrees), since a video signal (a videocomponent) for a plurality of tiles is transferred in a broadcast or bycommunication, it is possible to display a panorama video whichcorresponds to the type C tiling by synthesizing a video signal for thetiles in a receiving device. Here, the tiling methods for types A to Cdescribed above are examples and it is possible to adopt another tilingmethod.

“Layer” is an attribute which shows that the synthesis target componentis a component layer which is displayed in a hierarchy form. “Layer 1”,“layer 2”, . . . are designated, for example, in sequence from the backof the overlay as an attribute value of the layer attribute. Forexample, a video signal (a video component) of a first layer where“layer 1” is designated as an attribute value of the layer attribute istransmitted by a broadcast wave and a video signal (a video component)of a second layer where “layer 2” is designated as an attribute value ofthe layer attribute is distributed via a network. Due to this, in areceiving device which handles a layer display, it is possible todisplay a video where the video of the second layer is overlaid on thevideo of the first layer by synthesizing a video signal of the firstlayer and a video signal of the second layer.

“Mixing” is an attribute which shows that a synthesis target componentis a component to be mixed. For example, “Track 1”, “Track 2” . . . aredesignated as the mixing attribute. For example, a soundtrack (an audiocomponent) where “Track 1” is designated as the attribute value of themixing attribute is transmitted by a broadcast wave and a soundtrack (anaudio component) where “Track 2” is designated as the attribute value ofthe mixing attribute is distributed via a network. Due to this, it ispossible to output sound which is obtained by mixing a soundtrack 1 anda soundtrack 2 (for example, by adjusting a relative volume position ora panning position) in a receiving device which handles mixing.

Here, the attributes of the synthesis target or the attribute valuesthereof in the composite layer described above are examples and it maybe possible to designate another attribute or another attribute value.

(Selective Layer)

Finally, description will be given of the selective layer. As shown inFIG. 4, in the selective layer which is a higher hierarchy of thecomposite layer and the highest hierarchy, one component is selectedfrom a plurality of components by a straight line which swings left toright on a dotted arc in the diagram functioning as a switch. That is,the selective layer is a hierarchy for statically selecting one or aplurality of components from a component group of a fixed selectiontarget (referred to below as a “selective component group”) in eachcomponent category according to a predetermined selecting method. As theselecting method, other than selection by a user, automatic selection bya receiving device is possible, for example, according to theperformance of a receiving device, the preference information of theuser, or the like.

In addition, it is possible to designate a view tag, a language, areceiving device process requesting ability, a view title, a purpose, orthe like as an attribute of the fixed selection target in the selectivelayer.

In detail, “view tag” is a tag for combining different componentcategories which configure one view. For example, in a case in which “1”is designated as the view tag, components of the video, the audio, andthe subtitles to which the ID “1” is given in the diagram are selectedby moving to a new category. In the same manner, in a case in which “2”is designated as the view tag, components to which ID “2” is given inthe diagram are selected by moving to a new category. Here, componentsto which a view tag is not given are independent in each category.

For example, a language code is designated for the “language”. Forexample, a desired language is selected by the user by presenting agraphical user interface (GUI) according to the language code. A processrequesting ability which is necessary in the receiving device isdesignated as a “receiving device process requesting ability”. Therequesting ability may be designated by a level value, or may bemulti-dimensionally designated by a codec, a resolution, or the like.For example, when level 2 is designated as the level value in a case ofdesignating a process requesting ability by level value, only areceiving device which has a processing ability of level 2 or more isable to handle the request.

A title for selecting a view screen is designated as a “view title”. Forexample, by displaying the view title as text, a desired view screen isselected by the user. For example, information relating to the purposeof a component, such as sound for narration as opposed to sound for amain part, is designated as the “purpose”.

Here, the attributes of the fixed selection target in the selectivelayer described above are examples and it may be possible to designateanother attribute. In addition, the attribute of the fixed selectiontarget is not limited to one and a plurality of attributes may becombined and used.

Since it is possible to designate such fixed selection targetattributes, for example, an application which is being executed in thereceiving device is able to select a component based on the attribute ofthe fixed selection target. However, in a case in which there is onlyone selective component group, selection is not necessary and theselective component group is selected. In addition, components areselected in group units in a case of being grouped in a combination ofcomponents of different categories such as video, audio, and subtitlesby view tag.

In addition, in a case in which a plurality of components are selectedin the receiving device, when the selection target is the video andsubtitles components, a plurality of screens for the video and subtitlesare displayed. In addition, in a case in which a plurality of componentsare selected, when the selection target is only the audio component, aplurality of sounds are mixed (mixing) and then output.

Here, a case in which one selective layer is present in each componentcategory is shown in the example of a component layer structure in FIG.4; however, a plurality of selective layers may be present in eachcomponent category. In addition, in the component layer structure inFIG. 4, description is given with the components of the video, theaudio, and the subtitles as the components; however, it is possible toadopt the same layer structure for other components which configure aservice.

(Diagram of Component Layer Structure)

FIG. 5 is a diagram of a component layer structure.

In FIG. 5, a pick one component which is a level 3 layer is equivalentto the adaptive layer in FIG. 4 and one component is selected from n (nis an integer of 0 or greater) elementary components. Here, theelementary components in FIG. 5 represent the components themselves suchas the video, the audio, the subtitles, and the like.

A composite component which is a level 2 layer is equivalent to thecomposite layer in FIG. 4 and n (n is an integer of 0 or greater)components selected in the level 3 pick one component and n (n is aninteger of 0 or greater) elementary components are synthesized.

A pick one component which is a level 1 layer is equivalent to theselective layer in FIG. 4 and one component is selected from n (n is aninteger of 0 or greater) components which are synthesized in the level 2composite component, n (n is an integer of 0 or greater) componentswhich are selected in the level 3 pick one component, and n (n is aninteger of 0 or greater) elementary components.

In this manner, it is possible to represent the component layerstructure in FIG. 4 with a diagram such as shown in FIG. 5.

(Configuration Example of Signaling Information)

FIG. 6 is a diagram which shows a configuration example of signalinginformation (a control signal) for realizing the component layerstructure shown in FIG. 4 and FIG. 5. Here, in the elements andattributes in FIG. 6, “@” denotes the attributes. In addition, theelements and attributes which are indented are designated with respectto the upper elements thereof.

As shown in FIG. 6, a topAttribute attribute is an attribute for finalselection and is defined by m0 types (m0 is an integer of 1 or greater).For example, as the topAttribute attribute, it is possible to define aselection number and a selection determination. The selection numberdesignates the number of components which are able to be selected as awhole. In addition, the selection determination designates selection bythe user, automatic selection by the receiving device, or the like.

Information relating to a selective component group which is a componentgroup of a fixed selection target is designated as a selective componentgroup element. A selective component group element is an upper elementof a selectiveAttribute attribute and a composite component groupelement. However, the number of occurrences of a selective componentgroup element is n1 (n1 is an integer of 1 or greater).

The selectiveAttribute attribute indicates an attribute which isregulated in relation to a selective component group element, in whichm1 (m1 is an integer of 0 or greater) types of attributes of the fixedselection target are regulated. For example, in the selectiveAttributeattribute, individual attributes such as the view tag, the language, thereceiving device process requesting ability, the view title, and thepurpose are regulated as the attribute of the fixed selection target inthe selective layer described above.

Information relating to a composite component group which is a componentgroup which is a synthesis target is designated as a composite componentgroup element. A composite component group element is an upper elementof a compositeAttribute attribute and an adaptive component element.However, the number of occurrences of a composite component groupelement is n2 (n2 is an integer of 1 or greater).

A compositeAttribute attribute indicates an attribute which is regulatedin relation to a composite component group element, in which m2 (m2 isan integer of 0 or greater) types of attributes of a synthesis targetare regulated. For example, for a compositeAttribute attribute, anindividual attribute such as scalable, 3D, tile, layer, and mixing isregulated as the attribute of the synthesis target in the compositelayer described above. In addition, these attributes show types ofcombinations and it is possible to designate an attribute value whichshows an element in the combinations thereof.

Information relating to a component of an adaptive selection target isdesignated as an adaptive component element. An adaptive componentelement is an upper element of an adaptiveAttribute attribute and acomponent ID attribute. However, the number of occurrences of theadaptive component element is n3 (n3 is an integer of 1 or greater).

An adaptiveAttribute attribute shows an attribute which is regulated inrelation to an adaptive component element, in which m3 (m3 is an integerof 0 or greater) types of attributions of an adaptive selection targetare regulated. For example, for an adaptiveAttribute attribute, thetransfer path, the bit rate, or the like of a component is individuallyregulated as the attribute of the adaptive selection target in theadaptive layer described above. The ID of a component is designated as acomponent ID attribute.

Here, as described with reference to FIG. 6, a data configuration ofsignaling information for realizing the component layer structure shownin FIG. 4 and FIG. 5 is one example, and it is also possible to adoptanother data configuration. In addition, the signaling information isdescribed, for example, in a markup language such as an extensiblemarkup language (XML).

(2) Specific Examples of Component Layer Structure

Next, description will be given of specific examples of the componentlayer structure described above with reference to FIG. 7 to FIG. 9.

In a specific example of a component layer in FIG. 7, scalable isdesignated as the attribute of a synthesis target in a composite layerof a video, and in an adaptive layer which is a lower hierarchy thereof,a base stream is transferred in a broadcast (“scalable base” in thediagram) and an extended stream is transferred by communication(“scalable extended” in the diagram). Here, since only one broadcastvideo component of an adaptive selection target is prepared as the basestream, this one broadcast video component is selected every time. Onthe other hand, since a plurality of communication video components ofan adaptive selection target are prepared as the extended stream, anoptimum video component is adaptively selected from the plurality ofcommunication video components (“adaptive switching” in the diagram).

In the composite layer of the video, in order to supply a video with 4Kresolution, a video encoded signal (a video component) with 2Kresolution which is transferred in a broadcast and a video encodedsignal (a video component) which is equivalent to the difference between4K resolution and 2K resolution which is transferred by communicationare synthesized. Then, in a selective layer, for example, in a case inwhich a receiving device process requesting ability is designated as anattribute of the fixed selection target and the receiving device handles4K resolution, it is possible to display the synthesized video with 4Kresolution (“4K capable main view” in the diagram). On the other hand,in a case in which the receiving device does not handle 4K resolution, avideo with 2K resolution is displayed using only the base stream whichis transferred in a broadcast (“4K disable main view” in the diagram).That is, since the video component of the communication is not used inthis case, adaptive selection of the video component is not performed(“non switching” in the diagram).

Here, as shown in FIG. 8, the main view has the meaning of a maindisplay region on a display screen. The video with 4K resolution or 2Kresolution described above is displayed in the main view. In addition,it is possible to display a sub view which is an auxiliary displayregion with respect to the main display region as a screen example inFIG. 8. In the example of a component layer in FIG. 7, it is possible toselect a sub view 1 (“sub view 1” in the diagram) and a sub view 2 (“subview 2” in the diagram) as a selective component group in the selectivelayer.

In the sub view 1, since only one communication video component isprepared in an adaptive layer, adaptive selection is not performed andthis one communication video component is selected every time (“nonswitching” in the diagram). In addition, in the sub view 2, a pluralityof communication video components of an adaptive selection target areprepared in the adaptive layer, and successive optimum communicationvideo components are adaptively selected from a plurality ofcommunication video components (“adaptive switching” in the diagram).Here, in the screen example in FIG. 8, the sub view 1 and the sub view 2are simultaneously displayed in addition to the main view; however, asshown in the screen examples in FIGS. 9A to 9C, it is possible to allowthe user to select which view out of the main view, the sub view 1, andthe sub view 2 is to be displayed by the GUI screen or the like.

In the composite layer of the audio, scalable is designated as anattribute of a synthesis target thereof, and in an adaptive layer of alower hierarchy thereof, a stereo stream is transferred in a broadcastor by communication (“stereo” in the diagram) and a multi-channel streamis transferred by communication (“multi-channel dev” in the diagram).

Here, since a plurality of audio components which are the adaptiveselection target are prepared as a stereo stream, an optimum audiocomponent is adaptively selected from the plurality of broadcast orcommunication audio components (“adaptive switching” in the diagram).That is, by preparing each of an audio component with a normalrobustness (“normal robustness” in the diagram) and an audio componentwith a high robustness (“high robustness” in the diagram) as broadcastaudio components and making these able to be adaptively selected, forexample, an operation such as outputting only sound is possible in acase in which the audio component with a high robustness is selectedeven when a video may not be displayed on the receiving device for somereason. In addition, in a case in which a broadcast audio component maynot be received, a communication audio component may be selected.

On the other hand, since only one audio component which is an adaptiveselection target is prepared as a multi-channel stream, this onecommunication audio component is selected every time.

In an audio composite layer, a stereo audio component which istransferred in a broadcast and a multi-channel audio component which istransferred by communication are synthesized and a 22.2ch multi-channelsynthesized component is generated. Then, in the selective layer, forexample, in a case in which a receiving device process requestingability is designated as an attribute of the fixed selection target andthe receiving device handles a 22.2ch multi-channel, it is possible tooutput sound of the synthesized 22.2ch multi-channel (“22.2ch capablemain view” in the diagram). On the other hand, in a case in which thereceiving device does not handle a multi-channel of 22.2ch, only astereo stream which is transferred in a broadcast or by communication isused and stereo sound is output (“22.2ch disable main view” in thediagram).

Here, since view tag 1 is given as the attribute of the fixed selectiontarget of the audio selective layer thereof, the fixed selection targetis synchronized with a video component of the selective layer of thevideo where the view tag 1 is given in the same manner. In other words,the sound which corresponds to the audio component thereof is outputwith respect to a video which is displayed in the main view of thescreen example in FIG. 8.

In addition, as the attribute of the fixed selection target of theselective layer, an audio component to which view tag 2 is given issynchronized with the video component of the selective layer of thevideo to which view tag 2 is given in the same manner (“sub view 1” inthe diagram). In other words, the sound which corresponds to the audiocomponent is output with respect to the video which is displayed in thesub view 1 of the screen example in FIG. 8.

Furthermore, an audio component to which view tag 3 is given issynchronized with the video component to which view tag 3 is given inthe same manner (“sub view 2” in the diagram). Here, in the sub view 1and the sub view 2 of the audio, since only one communication audiocomponent is prepared in the adaptive layer, adaptive selection is notperformed and this one communication audio component is selected everytime.

In addition, as shown in FIG. 7, since the synthesis of the subtitlescomponent is not performed in the subtitles composite layer and,furthermore, adaptive selection of the subtitles component is notperformed in the adaptive layer, the subtitles component in theselective layer and the subtitles component in the adaptive layercorrespond one to one. Here, out of the subtitles components, only onesubtitles component on the leftmost side in the diagram is transferredin a broadcast and the other subtitles components are transferred bycommunication.

A subtitles component to which view tag 1 is given as the attribute ofthe fixed selection target of the selective layer is synchronized withcomponents of the video and the audio to which view tag 1 is given inthe same manner. In detail, in the case of the example, English andSpanish subtitles are supplied and more detailed and expositorysubtitles (“Eng(Ex)” and “Spa(Ex)” in the diagram) are prepared in thesubtitles in addition to the subtitles in the main part (“Eng(Nor)” and“Spa(Nor)” in the diagram). In the selective layer of the subtitles, forexample, in a case in which language is designated as an attribute ofthe fixed selection target by a user selection, it is possible todisplay subtitles according to the language code or the like. That is,subtitles such as English or Spanish which are selected by the user aresuperimposed and displayed on a video which is displayed on a main viewof the screen example in FIG. 8.

In addition, a subtitles component to which the view tag 2 is given asan attribute of a fixed selection target of a selective layer issynchronized with components of the video and the audio to which theview tag 2 is given in the same manner. In detail, since Englishsubtitles (“Eng” in the diagram) and Spanish subtitles (“Spa” in thediagram) are prepared, it is possible to superimpose and displaysubtitles according to a user selection on the video which is displayedin the sub view 1 of the screen example in FIG. 8.

Furthermore, a subtitles component to which the view tag 3 is given asan attribute of a fixed selection target of a selective layer issynchronized with components of the video and the audio to which theview tag 3 is given in the same manner. In detail, since the Englishsubtitles (“Eng” in the diagram) and the Spanish subtitles (“Spa” in thediagram) are prepared, it is possible to superimpose and displaysubtitles on the video according to a user selection.

<3. Configuration of Signaling Information> (1) Details of SignalingInformation

FIG. 10 is a diagram which shows details of signaling information.

As described above, for example, a service configuration description(SCD), an emergency alerting description (EAD), and a region ratingdescription (RRD) are transferred as an LLS.

The SCD adopts an ID system which is equivalent to the triplet which isused in an MPEG2-TS method and the BBP stream configuration and theservice configuration in a broadcast network are shown due to thetriplet. In addition, the SCD includes information such as an IP addressas attribute and setting information in a service unit, ESG bootstrapinformation for accessing an ESG, and SC bootstrap information foraccessing an SCS. The EAD includes information relating to an emergencynotification. The RRD includes rating information. Here, the SCD, theEAD, and the RRD are described, for example, by a markup language suchas an extensible markup language (XML).

For example, a service parameter description (SPD), a user servicedescription (USD), a media presentation description (MPD), and a sessiondescription protocol (SDP) are transferred as an SCS.

The SPD includes various types of parameters which are regulated by thelevel of the service and component. The USD includes information or thelike for referring to the MPD or the SDP. The MPD is information formanaging adaptive streaming distribution of a component and includesinformation of a segment uniform resource locator (URL) for eachcomponent which is transferred in a service unit.

The SDP includes a service attribute in a service unit, configurationinformation of a component, a component attribute, filter information ofa component, location information of a component, and the like. By usingthe USD, the MPD, and the SDP, it is possible to acquire a componentwhich is transferred, for example, in an RTP session or a FLUTE session.Here, an SPD, a USD, an MPD, and an SDP are described, for example, by amarkup language such as XML.

The ESG is an electronic service guide which includes, for example,information such as a program title or a starting time. It is possibleto acquire the ESG by using ESG bootstrap information of the SCD. Anapplication is configured by a file in hypertext markup language (HTML)format or the like and is distributed, for example, from a server on theinternet. The application is executed in synchronization with broadcastcontent such as a television program which is supplied as a specificservice. It is possible to correlate the ESG and the application withthe USD.

(2) Component Layer Correspondence Using MPD

FIG. 11 is a diagram which shows component layer correspondence using anMPD.

For the MPD, a period element, an adaptation set element, arepresentation element, and a sub representation element are describedin a hierarchy structure. The period element is a unit which describesthe configuration of content such as a television program. In addition,the adaptation set element, the representation element, or the subrepresentation element are used for each stream such as the video, theaudio, or the subtitles, and are able to describe the attributes of eachstream.

In detail, the adaptation set element represents streams which areencoded from various types of sources. Then, in order to select thestream on the receiving device side according to, for example, aparametric such as a bit rate, a representation element is arranged inan adaptation set element and, for example, streams which are aplurality of choices where parameters such as the bit rate are differentare listed. Typically, an adaptation set element or a representationelement correspond with a single stream such as a stream of video,audio, or subtitles.

Here, it is possible to realize the function of the level 3 layer inFIG. 11 by mapping a pick one component in an MPD adaptation set elementand further mapping a representation element or a sub representationelement which are listed in an adaptation set element in an elementarycomponent.

In addition, it is possible to realize the function of the level 2 layerin FIG. 11 by mapping in a dependence relationship description attributewhich defines a relationship between MPD adaptation set elements, arelationship between representation elements, or a relationship betweensub representation elements. As the dependence relationship descriptionattribute, it is possible to group an adaptation set attribute group bydefining a new element by using an essential property element or asupplemental property element which is a lower element of an adaptationset element which is regulated by the MPD.

Furthermore, it is possible to realize the function of the level 1 layerin FIG. 11 by mapping in a group attribute of an MPD adaptation setelement. The group attribute of the adaptation set element is forgrouping adaptation set elements and adaptation set elements which havethe same attribute value belong to the same group. Then, one adaptationset element is selected from a plurality of adaptation set elements inthe same group.

(Specific Operative Examples 1)

FIG. 12 is a diagram which shows a specific operative example ofcomponent layer correspondence using an MPD.

Description content of the MPD for realizing the operative example isshown in FIG. 12 and “AS” indicates an adaptation set element and “R”indicates a representation element. In addition, an ID attribute shownby “@id” and a group attribute shown by “@gid” are described in theadaptation set element. Furthermore, “@schemeIdUri” represents aschemeIdUri attribute of an essential property element.

Four adaptation set elements (id=“11”, “12”, “13”, and “14”) whichbelong to a group 1 of a video and four adaptation set elements(id=“21”, “22”, “23”, and “24”) which belong to a group 2 of an audioare described in the MPD in FIG. 12 and one or a plurality ofrepresentation elements are described in a subordinate of each of theadaptation set elements.

In the group 1 of the video, “base”, “ext”, “SV1”, and “SV2” are listedas video components in each representation element. Here, “base”represents a video component which is equivalent to a basic video signalwhich is able to be played back independently and “ext” represents avideo component which is equivalent to a video signal for extension. Inaddition, in “SV1” and “SV2”, “SV” represents a sub view which is anauxiliary region with respect to the main view which is the main displayregion.

Only one representation element is described in a subordinate of theadaptation set element with id=“11” and one video component which isequivalent to a basic video signal which is able to be played backindependently is selected every time. The video component which isselected in this manner is equivalent to an elementary component. Here,the video component is for the main view.

“urn: . . . :SVC” is designated as an attribute value of a schemeldUriattribute of an essential property element in an adaptation set elementwith id=“12”. Here, in the operative example in FIG. 12, “urn: . . .:SVC” is defined as an attribute value of a schemeldUri attribute of anessential property element, and an adaptation set attribute which has anessential property element which has the attribute value is defined soas to have a meaning that representation element groups in a subordinatethereof have a dependence relationship of a base-enhance relationshipwith each other.

Accordingly, four representation elements in a subordinate of anadaptation set element with id=“12” have a dependence relationship of abase-enhance relationship. In other words, four representation elementsare described in a subordinate of an adaptation set element withid=“12”, one video component which is equivalent to a basic video signalis listed, three video components which are equivalent to a video signalfor extension are listed, and there is a dependence relationship of abase-enhance relationship. Due to this, in the level 3 (a pick onecomponent) layer, for example, a video component which is equivalent toa basic video signal and a video signal for extension is selectedaccording to network environment conditions or the like. Here, the videocomponent is for the main view.

In this manner, in the MPD, an essential property element is used inorder to express features of a component group which is listed in arepresentation element in a subordinate of an adaptation set attribute.

Only one representation element is described in a subordinate of anadaptation set element with id=“13” and one video component for the subview 1 is selected every time. The video component which is selected inthis manner is equivalent to an elementary component.

Three representation elements are described in a subordinate of anadaptation set element with id=“14” and three video components for thesub view 2 are listed and in the level 3 (a pick one component) layer,for example, one video component is adaptively selected according tonetwork environment conditions or the like.

In this manner, by mapping a pick one component in an adaptation setelement and further mapping a representation element which is listed inan adaptation set element in an elementary component, the function ofthe level 3 layer in the component layer structure in FIG. 11 isrealized and selection of the video component is performed.

Here, in the MPD in FIG. 12, group=“1” is designated as a groupattribute in an adaptation set element with id=“11”, an adaptation setelement with id=“12”, an adaptation set element with id=“13”, and anadaptation set element with id=“14” and the adaptation set elementsbelong to the same group 1.

In this manner, by performing grouping according to the group attribute,the function of the level 1 layer in the component layer structure inFIG. 11 is realized and one video component is selected from videocomponents in the same group in the level 1 (a pick one component)layer. Here, one video component is selected from the video componentsand elementary components which are selected in the level 3 (a pick onecomponent) layer which belongs to the group 1.

On the other hand, in the group 2 of the audio, “NrRbst”, “HiRbst”,“MCD”, “SV1”, and “SV2” are listed as audio components in eachrepresentation element. “NrRbst” has a meaning of an audio componentwith a normal robustness. In addition, “HiRbst” has a meaning of anaudio component with a high robustness. “MCD” indicates a multi-channelaudio component. Furthermore, in “SV1” and “SV2”, “SV” indicates anaudio component for a sub view.

Two representation elements are described and an audio component whichhas a normal robustness and an audio component which has a highrobustness are listed in a subordinate of an adaptation set element withid=“21” and in the level 3 (a pick one component) layer, for example,one audio component is adaptively selected according to networkenvironment conditions or the like. Here, the audio component is for themain view.

Only one representation element is described in a subordinate of anadaptation set element with id=“22” and one multi-channel audiocomponent is selected every time. An audio component which is selectedin this manner is equivalent to an elementary component. Here, the audiocomponent is for the main view.

Only one representation element is described in a subordinate of anadaptation set element with id=“23” and one audio component for the subview 1 is selected every time. In the same manner, one audio componentfor the sub view 2 is selected every time in a subordinate of anadaptation set element with id=“24”. An audio component for the sub viewwhich is selected in this manner is equivalent to an elementarycomponent.

Here, in the MPD in FIG. 12, group=“2” is designated as a groupattribute in an adaptation set element with id=“21”, an adaptation setelement with id=“22”, an adaptation set element with id=“23”, and anadaptation set element with id=“24” and the adaptation set elementsbelong to the same group 2.

In this manner, by performing grouping according to the group attribute,the function of the level 1 layer in the component layer structure inFIG. 11 is realized, and one audio component is selected from audiocomponents in the same group in the level 1 (a pick one component)layer. Here, one audio component is selected from audio components andelementary components which are selected in the level 3 (a pick onecomponent) layer which belongs to the group 2.

In addition, in the MPD in FIG. 12, a subset element is used in order todefine grouping for simultaneous presentation of a plurality ofcomponents such as a video or an audio outside the limits of regulationsrelating to a component layer structure.

In detail, “11 21” is designated as a contains attribute of a firstsubset element, which shows that an adaptation set element with id=“11”and an adaptation set element with id=“21” are a combination ofadaptation set elements which are simultaneously presented. That is, avideo component (“V-base”) and an audio component (“A-NrRbst” or“A-HiRbst”) are components for the main view which are simultaneouslyplayed back.

“11 22” is designated as a contains attribute of a second subsetelement, which shows that an adaptation set element with id=“11” and anadaptation set element with id=“22” are a combination of adaptation setelements which are simultaneously presented. That is, a video component(“V-base”) and an audio component (“A-MCD”) are components for the mainview which are simultaneously played back.

“11 12 21” is designated as a contains attribute of a third subsetelement, which shows that an adaptation set element with id=“11”, anadaptation set element with id=“12”, and an adaptation set element withid=“21” are a combination of adaptation set elements which aresimultaneously presented. That is, a video component (“V-base” or“V-base” and “V-ext”) and an audio component (“A-NrRbst” or “A-HiRbst”)are components for the main view which are simultaneously played back.

“11 12 22” is designated as a contains attribute of a fourth subsetelement, which shows that an adaptation set element with id=“11”, anadaptation set element with id=“12”, and an adaptation set element withid=“22” are a combination of adaptation set elements which aresimultaneously presented. That is, a video component (“V-base” or“V-base” and “V-ext”) and an audio component (“A-MCD”) are componentsfor the main view which are simultaneously played back.

“13 23” is designated as a contains attribute of a fifth subset element,which shows that an adaptation set element with id=“13” and anadaptation set element with id=“23” are a combination of adaptation setelements which are simultaneously presented. That is, a video component(“V-SV1”) and an audio component (“A-SV1”) are components for the subview 1 which are simultaneously played back.

“14 24” is designated as a contains attribute of a sixth subset element,which shows that an adaptation set element with id=“14” and anadaptation set element with id=“24” are a combination of adaptation setelements which are simultaneously presented. That is, a video component(“V-SV2”) and an audio component (“A-SV2”) are components for the subview 2 which are simultaneously played back.

In this manner, by using a subset element in order to define groupingfor simultaneous presentation of a plurality of components such as videoor audio outside the limits of regulations relating to a component layerstructure in FIG. 11, for example, it is possible to correlate for themain view, for the sub view 1, for the sub view 2, or the like between avideo and an audio. In addition, for example, when one component out ofcomponents of a video and an audio is specified, the other component isalso specified.

Above, description was given of specific operative examples.

(3) Coordination of SPD and MPD

FIG. 13 is a diagram which shows a method for coordinating SPD and MPD.

As shown in FIG. 13, a component, a component group, a componentcategory, and a service form a hierarchy structure in an SPD. Inaddition, a representation element, an adaptation set element, and agroup form a hierarchy structure in an MPD. However, as described above,an MPD group is equivalent to a group attribute of an adaptation setelement.

In such a hierarchy structure, a component of an SPD and arepresentation element of an MPD are mapped by a representation ID.Furthermore, a component group of an SPD and an adaptation set elementof an MPD are mapped by an adaptation set ID. In addition, a componentcategory of an SPD and a group attribute of an adaptation set element ofan MPD are mapped by a group ID.

In this manner, since it is possible to perform an operation where anSPD and an MPD are mutually coordinated by mapping in each hierarchy, itis possible to process components (objects) which are included in eachlayer in a cross-sectional manner by sharing parameters of each layer ofthe SPD and the MPD. Here, the example of the hierarchy structure inFIG. 13 shows that a hierarchy which corresponds to a component of anSPD is an adaptive layer and a composite layer, and that a hierarchywhich corresponds to a component group of an SPD is a selective layer;however, these relationships are examples and, for example, it may beunderstood that a hierarchy which corresponds to a component of an SPDis an adaptive layer, a hierarchy which corresponds to a component groupof an SPD is a composite layer, and a hierarchy which corresponds to acomponent category of an SPD is a selective layer.

Specific Operative Examples 2

FIG. 14 is a diagram which shows a specific operative example of acoordination of SPD and MPD.

In FIG. 14, description content of an SPD is shown on the left side anda description content of an MPD is shown on the right side. In addition,in the same manner as FIG. 4 and the like described above, components ofvideo and audio are configured by three hierarchies of a selectivelayer, a composite layer, and an adaptive layer.

In the adaptive layer of the SPD, “C” has a meaning of a component, “V”has a meaning of video, and “A” has a meaning of audio. In addition,“RTP” shows that a component is transferred in an RTP session, that is,an RTP stream is transferred in a broadcast. In addition, “DASH” showsthat a component is distributed in streaming which conforms to standardsof a moving picture expert group—dynamic adaptive streaming over HTTP(MPEG-DASH), that is, a DASH stream is transferred by communication.

In detail, a video component C1 represented by “C1” in the diagram istransferred as an RTP stream in a broadcast. On the other hand, videocomponents C2 to C5 represented by “C2” to “C5” in the diagram aretransferred as a DASH stream by communication. Here, the bit rate of thevideo component C2 is, for example, a low bit rate such as 1 Mbps. Onthe other hand, the bit rate of the video component C3 is, for example,a high bit rate such as 10 Mbps.

In addition, an audio component C6 represented by “C6” in the diagram istransferred as an RTP stream in a broadcast. An audio component C7 andan audio component C8 represented by “C7” and “C8” in the diagram aretransferred as a DASH stream by communication.

On the other hand, in an adaptive layer of an MPD, “R” in the adaptivelayer has a meaning of a representation element of an MPD. That is,video components R1 to R4 which are represented by “R1” to “R4” in thediagram and which are described in a representation element aretransferred as a DASH stream by communication. Here, the bit rate of thevideo component R1 is, for example, a low bit rate such as 1 Mbps. Onthe other hand, the bit rate of the video component R2 is, for example,a high bit rate such as 10 Mbps.

In addition, an audio component R5 and an audio component R6 representedby “R5” and “R6” in the diagram are transferred as a DASH stream bycommunication.

When adaptive layers of the SPD and the MPD formed by the configurationdescribed above are compared, since the MPD is regulated in order tomanage a streaming distribution which conforms to standards ofMPEG-DASH, description is basically only given of a components which aretransferred by communication as a DASH stream. On the other hand, sincedescription relating to the components is not limited according to thetransfer form in the SPD, it is possible to describe both componentswhich are transferred in a broadcast as an RTP stream and componentswhich are transferred by communication as a DASH stream.

In other words, in the description examples of the SPD and the MPD inFIG. 14, a component which is transferred as a DASH stream bycommunication which is the same as description content of the MPD isdescribed in the SPD, in addition to a component which is transferred asan RTP stream in a broadcast. That is, the components which aredescribed in the MPD are all described in the SPD. Then, arepresentation ID is designated for each component in the SPD and sincethe representation ID correlates with a representation ID of arepresentation element of the MPD, it is possible to associatecommunication components which are described in the SPD and the MPD.

That is, in the adaptive layer in FIG. 14, as shown by an arrow in thediagram, video components C2, C3, C4, and C5 of the SPD and videocomponents R1, R2, R3, and R4 of the MPD are associated according to arepresentation ID. In addition, as shown by an arrow in the diagram, theaudio components C7 and C8 of the SPD and audio components R5 and R6 ofthe MPD are associated according to a representation ID.

In addition, in the composite layer of the SPD, “CG” has a meaning of acomponent group, “V” has a meaning of video, and “A” has a meaning ofaudio. That is, video components C1 to C3 in a component group CG1represented by “CG1-V” in the diagram are synthesized and form asynthesized component.

Here, a scalable attribute is designated as an attribute of thesynthesis target in the composite layer. That is, “Base” is designatedas an attribute value of the scalable attribute in the video componentC1 and “Extended” is designated as an attribute value of the scalableattribute in the video component C2 and the video component C3. Inaddition, the bit rate of the video component C2 is, for example, a lowbit rate such as 1 Mbps. On the other hand, the bit rate of the videocomponent C3 is, for example, a high bit rate such as 10 Mbps.

Due to this, the video component C1, which is transferred in a broadcastby an RTP stream as a base stream, and the video component C2 or thevideo component C3, which is transferred by communication by a DASHstream as an extended stream, are synthesized and a synthesizedcomponent is obtained in the component group CG1. Here, a communicationvideo component of one of the video component C2 and the video componentC3 which are transferred as a DASH stream by communication and whichhave different bit rates is selected, for example, according to thecongestion status of a communication path.

In addition, since only one communication video component C4 is preparedin a component group CG2 represented by “CG2-V” in the diagram, adaptiveselection is not performed in the adaptive layer and in addition,synthesis is also not performed in the composite layer. Furthermore,since only one communication video component C5 is prepared in acomponent group CG3 represented by “CG3-V” in the diagram, adaptiveselection is not performed in the adaptive layer and in addition,synthesis is also not performed in the composite layer.

Furthermore, since only one broadcast audio component C6 is prepared ina component group CG4 represented by “CG4-A” in the diagram, adaptiveselection is not performed in the adaptive layer and in addition,synthesis is also not performed in the composite layer. In the samemanner, in addition, a communication audio component C7 is selectedevery time with regard to a component group CG5 represented by “CG5-A”in the diagram, and the communication audio component C8 is selectedevery time with regard to a component group CG6 represented by “CG6-A”.

On the other hand, in a composite layer of the MPD, “AS” has a meaningof an adaptation set element of an MPD. That is, in an adaptation setAS1 represented by “AS1-V” in the diagram, a communication videocomponent of one of a video component R1 and a video component R2 whichhave different bit rates is selected, for example, according to thecongestion status of a communication path.

In addition, since only one communication video component R3 is preparedin an adaptation set AS2 represented by “AS2-V” in the diagram, thevideo component R3 is selected every time. In the same manner, inaddition, since only one communication video component R4 is prepared inan adaptation set AS3 represented by “AS3-V” in the diagram, the videocomponent R4 is selected every time.

Furthermore, since only one communication audio component R5 is preparedin an adaptation set AS4 represented by “AS4-A” in the diagram, theaudio component R5 is selected every time. In the same manner, inaddition, since only one communication audio component R6 is prepared inan adaptation set AS5 represented by “AS5-A” in the diagram, the audiocomponent R6 is selected every time.

When composite layers of the SPD and the MPD formed by the configurationdescribed above are compared, while only communication components aredescribed in an adaptation set element of the MPD, a component group forbroadcast components is described in an SPD in addition to communicationcomponents. Then, an adaptation set ID is designated for each componentgroup in the SPD and since the adaptation set ID correlates with anadaptation set ID of an adaptation set element of the MPD, it ispossible to correlate the component groups of the SPD and the adaptationset element of the MPD.

That is, in the composite layer in FIG. 14, as shown by an arrow in thediagram, component groups CG1, CG2, and CG3 of the SPD and adaptationsets AS1, AS2, and AS3 of the MPD are associated according to anadaptation set ID. In addition, as shown by an arrow in the diagram,component groups CG5 and CG6 of an SPD and adaptation sets AS4 and AS5of an MPD are associated according to an adaptation set ID.

In addition, in a selective layer of the SPD, “CC” has a meaning of acomponent category, “V” has a meaning of video, and “A” has a meaning ofaudio. That is, in a component category CC1 represented by “CC1-V” inthe diagram, any one video component of the component groups CG1 to CG3is selected. In addition, in a component category CC2 represented by“CC2-A” in the diagram, any one audio component of the component groupsCG4 to CG6 is selected.

On the other hand, in a selective layer of the MPD, “group” has ameaning of a group. That is, in a group g1 represented by “group1-V” inthe diagram, any one communication video component out of the adaptationsets AS1 to AS3 is selected. In addition, in a group g2 represented by“group2-A”, any one communication audio component out of the adaptationsets AS4 and AS5 is selected.

Then, a group ID is designated for each component category in the SPDand since the group ID correlates with a group ID of the MPD, it ispossible to correlate the component categories of the SPD and a groupattribute of an adaptation set element of the MPD.

That is, in the selective layer in FIG. 14, as shown by an arrow in thediagram, the component category CC1 of the SPD and the group g1 of theMPD are associated according to a group ID. In addition, as shown by anarrow in the diagram, the component category CC2 of the SPD and thegroup g2 of the MPD are associated according to a group ID.

Above, as shown in the operative example in FIG. 14, a component of theSPD and a representation element of the MPD are mapped by arepresentation ID, a component group of the SPD and an adaptation setelement of the MPD are mapped by an adaptation set ID, and a componentcategory of the SPD and a group attribute of an adaptation set elementof the MPD are mapped by a group ID. Due to this, since it is possibleto perform an operation where the SPD and the MPD are mutuallycoordinated, it is possible to process components (objects) which areincluded in each layer in a cross-sectional manner by sharing parametersof each layer of the SPD and the MPD.

For example, in a case of the operative example in FIG. 14, since it isonly possible to describe a communication component when with only anMPD, in the adaptation set AS1, it is only possible to express that oneof the video component R1 and the video component R2 which havedifferent bit rates is selected; however, by referring to a coordinatedSPD, it is possible to recognize not only that one of the videocomponent R1 (C2) and the video component R2 (C3) is selected but alsothat the selected communication video component is synthesized with thebroadcast video component C1.

In addition, since the MPD is regulated in order to manage a streamingdistribution which conforms to standards of an MPEG-DASH, description isonly given of a communication component; however, by using the SPD, itis possible to describe a broadcast component in addition to acommunication component. For example, in a case of using the MPD as themain, it is possible to acquire the information which is lacking as acomplement by referring to the SPD. Here, since it is possible todescribe both broadcast components and communication components in theSPD, the SPD may be used alone. In addition, since it is possible todescribe communication components in the MPD, signaling informationwhere a broadcast component and a communication component are describedmay be split by describing only broadcast components in the SPD.

(4) Syntax (Syntax of SPD)

FIG. 15 is a diagram which shows syntax of the SPD. An SPD is described,for example, in a markup language such as an XML. Here, in the elementsand attributes in FIG. 15, “@” denotes the attributes. In addition, theelements and attributes which are indented are designated with respectto the upper elements thereof. In addition, the relationship between theelements and the attributes is the same in another syntax which will bedescribed below.

As shown in FIG. 15, an Spd element is an upper element of a service IDattribute, a sp indicator attribute, a component layer descriptorelement, a protocol version descriptor element, an NRT servicedescriptor element, a capability descriptor element, an icon descriptorelement, an ISO 639 language descriptor element, a receiver targetingdescriptor element, an associated service descriptor element, a contentadvisory descriptor element, and a component element.

A service ID is designated as a service ID attribute. Whether or notencryption is carried out for each service which is distinguished by aservice ID is designated as a sp indicator attribute. The service isshown to be encrypted in a case in which “on” is designated as the spindicator attribute and the service is shown to not be encrypted in acase in which “off” is designated.

Information relating to the component layer structure is designated asthe component layer descriptor element. Information for indicating whatkind of service is a data service is designated as the protocol versiondescriptor element.

Information relating to an NRT service is designated as the NRT servicedescriptor element. Information relating to a function (capability)which is requested for the receiving device which receives the provisionof the NRT service is designated as the capability descriptor element.

Information which indicates an acquisition destination of an icon whichis used in the NRT service is designated as the icon descriptor element.A language code of the NRT service is designated as the ISO 639 languagedescriptor element. Target information of the NRT service is designatedas the receiver targeting descriptor element.

Information relating to an associated subordination service isdesignated as an associated service descriptor element. Informationrelating to a rating region is designated as the content advisorydescriptor.

Various types of parameters are regulated in a service level by thedescriptor elements described above in the SPD. In addition, varioustypes of parameters are regulated in a component level by componentelements below.

A component element is an upper element of a component ID attribute, arepresentation ID attribute, a sub representation level attribute, acomponent category attribute, a location type attribute, a componentencryption attribute, a composite position attribute, a targeted screenattribute, a content advisory descriptor element, an AVC videodescriptor element, a HEVC video descriptor element, an MPEG4 AAC audiodescriptor element, an AC3 audio descriptor element, and a captiondescriptor element.

A component ID is designated as the component ID attribute. Arepresentation ID of a corresponding MPD is designated as arepresentation ID attribute. A component of the SPD and a representationelement of the MPD are associated according to a representation ID.

A sub representation level is designated as a sub representation levelattribute. For example, in a case in which components of a plurality ofcategories (for example, a video or an audio) are stored in each segmentin a FLUTE session, the sub representation level is information fordistinguishing the components.

Category information of a component is designated as the componentcategory attribute. For example, “video”, “audio”, “caption”, and “nrt”are designated as the category information. “Video” indicates a videocomponent, “audio” indicates an audio component, and “caption” indicatesa subtitles component. In addition, “nrt” indicates data of NRT content.

Type information of a location of a component is designated as thelocation type attribute. For example, “bb”, “bca”, and “bco” aredesignated as the type information. “bb” is an abbreviation of broadbandand indicates that the component is distributed using communication.“bca” is an abbreviation of broadcast actual and indicates that acomponent is distributed using a broadcast and is distributed in thesame service as the service where the SPD (SCS) is transferred. “bco” isan abbreviation of broadcast other and indicates that a component isdistributed using a broadcast and is distributed in another servicewhich is different from the service where the SPD (SCS) is transferred.

Whether or not encryption is carried out for each component which isdistinguished by a component ID is designated as a component encryptionattribute. The component is shown to be encrypted in a case in which“on” is designated as a component encryption attribute and the componentis shown to not be encrypted in a case in which “off” is designated.

Information relating to the synthesis of components which is performedin a composite layer is designated as the composite position attribute.Information which corresponds to the attribute value of the compositetype attribute of a component group element of the component layerdescriptor (FIG. 16) is designated here. For example, in a case in whicha scalable attribute is designated as the composite type attribute ofthe component group element, “base” is designated as a compositeposition attribute when the target component is a base stream. Inaddition, “enhancement” is designated as a composite position attributewhen the target component is an extended stream.

Information relating to a display of a target device in a component unitis designated as the targeted screen attribute. For example, “primary”and “secondary” are designated as information relating to the display.“Primary” is designated in a case in which a video is displayed, forexample, on a television set as a first display device. “Secondary” isdesignated in a case in which a video is displayed, for example, on atablet terminal apparatus as a second display device. Rating informationin a component unit is designated as the content advisory descriptorelement.

In a case in which advanced video coding (AVC) is used as a method forencoding video data, video parameters are designated as the AVC videodescriptor element. In addition, in a case in which high efficiencyvideo coding (HEVC) is used as the method for encoding the video data,video parameters are designated as the HEVC video descriptor element.Here, the AVC and the HEVC are examples of a method for encoding videodata, and in a case in which another encoding method is used, acorresponding video descriptor element is designated.

In a case in which MPEG4 advanced audio coding (AAC) is used as a methodfor encoding audio data, audio parameters are designated as the MPEG4AAC audio descriptor element. In addition, in a case in which an audiocode number 3 (AC3) is used as the method for encoding audio data, audioparameters are designated as the AC3 audio descriptor element. Here, theMPEG4 AAC and the AC3 are examples of a method for encoding audio data,and in a case in which another encoding method is used, a correspondingaudio descriptor element is designated. A subtitles parameter isdesignated as the caption descriptor element.

Here, in the syntax of the SPD in FIG. 15, a protocol version descriptorelement, an NRT service descriptor element, a capability descriptorelement, an icon descriptor element, an ISO 639 language descriptorelement, and a receiver targeting descriptor element are regulated foran NRT service.

In addition, regarding the number of occurrences (cardinality) ofelements and attributes of the SPD shown in FIG. 15, only one of theelement and the attribute is constantly designated in a case in which“1” is designated, and whether or not to designate the element or theattribute is arbitrary in a case in which “0 . . . 1” is designated. Inaddition, one or more of the elements and the attributes are designatedin a case in which “1 . . . n” is designated and whether or not todesignate one or more of the elements or the attributes is arbitrary ina case in which “0 . . . n” is designated. The meaning of the number ofoccurrences is the same as in another syntax which will be describedbelow.

Next, description will be given of a specific configuration ofdescriptor elements which are described in an SPD in FIG. 15. Here, outof descriptor elements which are described in the SPD, description willbe given of a component layer descriptor element as a representative.

(Component Layer Descriptor)

FIG. 16 is a diagram which shows syntax of a component layer descriptor.The component layer descriptor is described, for example, in a markuplanguage such as an XML.

As shown in FIG. 16, information relating to a component layerconfiguration is designated as a component layer descriptor element. Acomponent layer descriptor element is an upper element of a componentcategory element. Information relating to a component category isdesignated as the component category element. The component categoryelement is an upper element of a category attribute, an mpd group IDattribute, and a component group element.

Category information (a title) of a component is designated as thecategory attribute. For example, “video”, “audio”, “caption”, and “nrt”are designated as the category information. “Video” indicates a videocomponent, “audio” indicates an audio component, and “caption” indicatesa subtitles component. In addition, “nrt” indicates data of NRT content.

A group ID of a corresponding MPD is designated as the mpd group IDattribute. Component categories of the SPD and a group attribute of anadaptation set element of the MPD are associated according to a groupID. Information relating to a component group is designated as acomponent group element.

A component group element is an upper element of an ID attribute, anadaptation set ID attribute, a default flag attribute, a mux IDattribute, an encryption attribute, a language attribute, a compositetype attribute, a usage attribute, a stereoscope attribute, an audiochannel config attribute, a target screen attribute, a viewpointdescriptor element, a content advisory descriptor element, and acomponent ID element.

A component group ID is designated as the ID attribute. An adaptationset ID of a corresponding MPD is designated as the adaptation set IDattribute. A component group of the SPD and an adaptation set element ofthe MPD are associated according to an adaptation set ID.

In a case in which a target component group is selected automatically(as default) when selecting a channel, “1” is designated as the defaultflag attribute. An ID of a combination between different componentcategories is designated as the mux ID attribute. For example, the samemux ID attribute is designated in the component group of the video andaudio in the main view. Here, for the ID, the same ID is designated withrespect to a combination of subset elements of the MPD. In a case inwhich a target component group is encrypted, “1” is designated as theencryption attribute. The language of a target component group isdesignated as the language attribute.

Type information of synthesis of components which is performed in acomposite layer is designated as the composite type attribute. Forexample, it is possible to designate scalable, mixing (mix), tile, andthe like as the type information. As described above, according to thetype information which is designated here, the value of a compositeposition attribute of a component element in the SPD is designated (FIG.15).

Information relating to the purpose of usage is designated as the usageattribute. For example, “vi”, “hi”, and “narration” are designated asthe information relating to the purpose of usage. “vi” indicates avisual disturbance, “hi” indicates an auditory disturbance, and“narration” indicates narration. In a case of 3D in the informationwhich indicates 3D or 2D, “1” is designated as the stereoscopeattribute. Information which indicates a configuration of a soundchannel is designated as the audio channel config attribute. “Monoral”,“Stereo”, “5.1ch”, and “22.1ch” are designated as the information whichindicates the configuration of the sound channel.

Information relating to a display of a target device in a componentgroup unit is designated as the target screen attribute. For example,“primary” and “secondary” are designated as the information relating tothe display. “Primary” is designated in a case in which a video isdisplayed, for example, on a television set as a first display device.“Secondary” is designated in a case in which a video is displayed, forexample, on a tablet terminal apparatus as a second display device.Information relating to a view point is designated as a view pointdescriptor element. An ID and a title for each view point are designatedas the information relating to the view point. Rating information in acomponent group unit is designated as the content advisory descriptorelement. A component ID of a component which is included in a targetcomponent group is designated as the component ID element.

(Another Configuration of Syntax of SPD)

Here, in the syntax of the SPD in FIG. 15 described above, the componentlayer descriptor in FIG. 16 is arranged as a descriptor in a servicelevel such that various types of parameters are regulated in two stagesof a service level and a component level; however, description contentof the component layer descriptor may be directly described in the SPD.Syntax of the SPD is shown in FIG. 17. Here, in a case in which elementsor attributes which configure the SPD in FIG. 17 have a meaning of thesame content as the elements or attributes which configure the SPD inFIG. 15, and description thereof will be omitted since the descriptionis a repetition.

As shown in FIG. 17, the Spd element is an upper element of a service IDattribute, an sp indicator attribute, a protocol version descriptorelement, an NRT service descriptor element, a capability descriptorelement, an icon descriptor element, an ISO 639 language descriptorelement, a receiver targeting descriptor element, an associated servicedescriptor element, a content advisory descriptor element, and acomponent category group element.

Information relating to a component category and a component group isdesignated as the component category group element. The componentcategory group element is an upper element of a component categoryattribute, an mpd group ID attribute, and a component group element.

Category information of a component is designated as the componentcategory attribute. For example, “video”, “audio”, “caption”, and “nrt”are designated as the category information. “Video” indicates a videocomponent, “audio” indicates an audio component, and “caption” indicatesa subtitles component. In addition, “nrt” indicates data of NRT content.

A group ID of a corresponding MPD is designated as the mpd group IDattribute. Component categories of the SPD and a group attribute of anadaptation set element of the MPD are associated according to a groupID. Information relating to a component group is designated as thecomponent group element.

A component group element is an upper element of an ID attribute, anadaptation set ID attribute, a default flag attribute, a mux IDattribute, an encryption attribute, a language attribute, a compositetype attribute, a usage attribute, a stereoscope attribute, an audiochannel config attribute, a target screen attribute, a viewpointdescriptor element, a content advisory descriptor element, and acomponent element. An adaptation set ID of a corresponding MPD isdesignated as the adaptation set ID attribute. A component group of theSPD and an adaptation set element of the MPD are associated according toan adaptation set ID.

A component element is an upper element of a component ID attribute, arepresentation ID attribute, a sub representation level attribute, acomponent category attribute, a location type attribute, a compositeposition attribute, an AVC video descriptor element, a HEVC videodescriptor element, an MPEG4 AAC audio descriptor element, an AC3 audiodescriptor element, and a caption descriptor element. A representationID of a corresponding MPD is designated as the representation IDattribute. A component of the SPD and a representation element of theMPD are associated according to a representation ID.

As described above, in the SPD in FIG. 17, it is possible to perform anoperation where the SPD and the MPD are mutually coordinated by mappingwith the MPD according to a group ID, an adaptation set ID, and arepresentation ID such that parameters in a component category level anda component group level are regulated in addition to two stages of aservice level and a component level. Here, the syntax of the SPDdescribed above is one example and another syntax may be adopted.

<4. System Configuration> (Configuration of Broadcast CommunicationSystem)

FIG. 18 is a diagram which shows a configuration of one embodiment of abroadcast communication system to which the present technique isapplied.

As shown in FIG. 18, a broadcast communication system 1 is configured bya data providing server 10, a transmitting apparatus 20, a distributionserver 30, and a receiving apparatus 40. In addition, the distributionserver 30 and the receiving apparatus 40 are mutually connected via anetwork 90 such as the internet.

The data providing server 10 supplies various types of components suchas video, audio, or subtitles to the transmitting apparatus 20 and thedistribution server 30. Here, for example, in order to realize anadaptive streaming distribution in a service which supplies a televisionprogram, as components which configure the television program, a videocomponent of 8 Mbps is supplied to the transmitting apparatus 20 andvideo components of 1 Mbps, 2 Mbps, 5 Mbps, 10 Mbps, and 20 Mbps aresupplied to the distribution server 30.

The transmitting apparatus 20 transmits various types of components (forexample, a video component of 8 Mbps) which are provided from the dataproviding server 10 by a broadcast wave of digital broadcasting. Inaddition, the transmitting apparatus 20 transmits a control signal(signaling information in FIG. 10) by a broadcast wave of digitalbroadcasting in addition to the components. Here, the control signal(signaling information in FIG. 10) may be distributed from a dedicatedserver or the like which is connected with the network 90.

The distribution server 30 distributes various types of components (forexample, video components of 1 Mbps, 2 Mbps, 5 Mbps, 10 Mbps, and 20Mbps) which are provided from the data providing server 10 to thereceiving apparatus 40 via the network 90 according to a request fromthe receiving apparatus 40.

The receiving apparatus 40 receives a broadcast signal which istransmitted from the transmitting apparatus 20 and acquires a controlsignal (signaling information in FIG. 10). The receiving apparatus 40acquires various types of components (for example, a video component of8 Mbps) such as video, audio, or subtitles which are transmitted fromthe transmitting apparatus 20 according to the control signal. Inaddition, the receiving apparatus 40 acquires various types ofcomponents (for example, video components of 1 Mbps, 2 Mbps, 5 Mbps, 10Mbps, and 20 Mbps) such as video, audio, or subtitles which aredistributed from the distribution server 30 according to the controlsignal.

The receiving apparatus 40 displays a video of components of a video orsubtitles on a display and outputs the sound of an audio component whichis synchronized with the video from a speaker. Here, for example, everyset period (for example, 10 seconds), an optimum video component isadaptively selected from a broadcast video component of 8 Mbps andcommunication video components of 1 Mbps, 2 Mbps, 5 Mbps, 10 Mbps, and20 Mbps, switched, and an adaptive streaming distribution is realized.

Here, the receiving apparatus 40 may be configured as a single bodyincluding a display or a speaker or may be incorporated into atelevision set, a video recorder, or the like.

The broadcast communication system 1 is configured as described above.Next, description will be given of the detailed configuration of eachapparatus which configures the broadcast communication system 1 in FIG.18.

(Configuration of Transmitting Apparatus)

FIG. 19 is a diagram which shows a configuration of one embodiment of atransmitting apparatus to which the present technique is applied.

As shown in FIG. 19, the transmitting apparatus 20 is configured by acomponent acquiring section 201, a control signal acquiring section 202,a Mux 203, and a transmitting section 204.

The component acquiring section 201 acquires various types of componentsfrom the data providing server 10 and supplies the components to the Mux203. The control signal acquiring section 202 acquires a control signal(signaling information in FIG. 10) from an external server such as thedata providing server 10 or an internal storage and supplies the controlsignal to the Mux 203.

The Mux 203 multiplexes various types of components from the componentacquiring section 201 and a control signal from the control signalacquiring section 202, generates a BBP stream, and supplies the BBPstream to the transmitting section 204. The transmitting section 204transmits the BBP stream which is supplied from the Mux 203 via anantenna 205 as a broadcast signal.

(Configuration of Distribution Server)

FIG. 20 is a diagram which shows a configuration of one embodiment of adistribution server to which the present technique is applied.

As shown in FIG. 20, the distribution server 30 is configured by acontrol section 301, a component acquiring section 302, an accumulationsection 303, and a communication section 304.

The control section 301 controls an operation of each section of thedistribution server 30.

The component acquiring section 302 acquires various types of componentsfrom the data providing server 10 and supplies the components to thecontrol section 301. The control section 301 accumulates various typesof components from the component acquiring section 302 in theaccumulation section 303. Due to this, various types of components fromthe data providing server 10 are accumulated in the accumulation section303.

The communication section 304 exchanges various types of data with thereceiving apparatus 40 via the network 90 under the control of thecontrol section 301. In a case in which the communication section 304receives a request for the distribution of a stream (a component) fromthe receiving apparatus 40, the control section 301 reads out acomponent which corresponds to the request from the accumulation section303. The control section 301 controls the communication section 304 anddistributes a stream formed of components which are read out from theaccumulation section 303 to the receiving apparatus 40 via the network90.

(Configuration of Receiving Apparatus)

FIG. 21 is a diagram which shows a configuration of one embodiment of areceiving apparatus to which the present technique is applied.

As shown in FIG. 21, the receiving apparatus 40 is configured by a tuner402, a Demux 403, a selection/synthesis section 404, aselection/synthesis section 405, a selection/synthesis section 406, acontrol section 407, an NVRAM 408, an input section 409, a communicationsection 410, a Demux 411, a video decoder 412, a video output section413, an audio decoder 414, an audio output section 415, and a subtitlesdecoder 416.

The tuner 402 extracts and demodulates a broadcast signal of a servicefor which channel selection is instructed from a broadcast signal whichis received by an antenna 401 and supplies a BBP stream which isobtained as a result to the Demux 403.

The Demux 403 separates the BBP stream which is supplied from the tuner402 into each component and a control signal, supplies each of thecomponents to the selection/synthesis sections 404 to 406, and suppliesthe control signal to the control section 407. Here, as components, avideo component, an audio component, and a subtitles component areseparated and are respectively supplied to the selection/synthesissection 404, the selection/synthesis section 405, and theselection/synthesis section 406.

The control section 407 controls an operation of each section of thereceiving apparatus 40. The NVRAM 408 is a non-volatile memory andrecords various types of data under the control of the control section407. In addition, the control section 407 controls a selection/synthesisprocess which is performed by the selection/synthesis sections 404 to406 based on a control signal (signaling information in FIG. 10) whichis supplied from the Demux 403.

The input section 409 supplies an operation signal to the controlsection 407 according to an operation of the user. The control section407 controls an operation of each section of the receiving apparatus 40based on the operation signal from the input section 409.

The communication section 410 exchanges various types of data with thedistribution server 30 via the network 90 under the control of thecontrol section 407. The communication section 410 supplies a streamwhich is received from the distribution server 30 to the Demux 411. Atthat time, the communication section 410 receives a stream which isdistributed from the distribution server 30 while buffering stream datain the receiving buffer 421 which is provided therein.

The Demux 411 separates a stream which is supplied from thecommunication section 410 into each of the components and supplies thecomponents to the selection/synthesis sections 404 to 406. Here, out ofthe components after the separation, a video component is supplied tothe selection/synthesis section 404, an audio component is supplied tothe selection/synthesis section 405, and a subtitles component issupplied to the selection/synthesis section 406.

The selection/synthesis section 404 performs a selection/synthesisprocess (for example, a process of each layer in the video componentlayer in FIG. 4) with respect to a video component from the Demux 403and a video component from the Demux 411 under the control of thecontrol section 407 and supplies a video component which is obtained asa result of the process to the video decoder 412.

The video decoder 412 decodes a video component which is supplied fromthe selection/synthesis section 404 and supplies video data which isobtained as a result to the video output section 413. The video outputsection 413 outputs the video data which is supplied from the videodecoder 412 to a display in a rear stage (which is not shown in thediagram). Due to this, for example, a video of a television program orthe like is displayed on a display.

The selection/synthesis section 405 performs a selection/synthesisprocess (for example, a process of each layer in the audio componentlayer in FIG. 4) with respect to an audio component from the Demux 403and an audio component from the Demux 411 under the control of thecontrol section 407 and supplies an audio component which is obtained asa result of the process to the audio decoder 414.

The audio decoder 414 decodes an audio component which is supplied fromthe selection/synthesis section 405 and supplies audio data which isobtained as a result to the audio output section 415. The audio outputsection 415 supplies the audio data which is supplied from the audiodecoder 414 to a speaker in a rear stage (which is not shown in thediagram). Due to this, for example, sound which corresponds to the videoof a television program is output from a speaker.

The selection/synthesis section 406 performs a selection/synthesisprocess (for example, a process of each layer in the subtitles componentlayer in FIG. 4) with respect to a subtitles component from the Demux403 and a subtitles component from the Demux 411 under the control ofthe control section 407 and supplies a subtitles component which isobtained as a result of the process to the subtitles decoder 416.

The subtitles decoder 416 decodes a subtitles component which issupplied from the selection/synthesis section 406 and supplies subtitlesdata which is obtained as a result to the video output section 413. In acase in which subtitles data is supplied from the subtitles decoder 416,the video output section 413 synthesizes the subtitles data with videodata from the video decoder 412 and supplies the result to a display ina rear stage (which is not shown in the diagram). Due to this, subtitlesare superimposed on the video of the television program and displayed onthe display.

Here, in the receiving apparatus 40 in FIG. 21, for convenience ofdescription, a configuration where the selection/synthesis sections 404to 406 are provided at a front stage of each decoder is shown; however,depending on the content of the selection/synthesis process, aconfiguration where the selection/synthesis sections 404 to 406 areprovided at a rear stage of each decoder may be adopted.

<5. Flow of Specific Processes Executed in Each Apparatus>

Next, description will be given of a flow of a process which is executedin each apparatus which configures the broadcast communication system 1in FIG. 18 with reference to the flowcharts of FIG. 22 to FIG. 24.

(Transmission Process)

Firstly, description will be given of a transmission process which isexecuted by the transmitting apparatus 20 in FIG. 18 with reference tothe flowchart of FIG. 22.

In step S201, the component acquiring section 201 acquires various typesof components from the data providing server 10 and supplies thecomponents to the Mux 203. In step S202, the control signal acquiringsection 202 acquires a control signal (signaling information in FIG. 10)from an external server or the like and supplies the control signal tothe Mux 203.

In step S203, the Mux 203 multiplexes various types of components fromthe component acquiring section 201 and a control signal from thecontrol signal acquiring section 202, generates a BBP stream, andsupplies the BBP stream to the transmitting section 204. In step S204,the transmitting section 204 transmits the BBP stream which is suppliedfrom the Mux 203 via the antenna 205 as a broadcast signal. When theprocess of step S204 ends, the transmission process in FIG. 22 ends.

Above, description was given of a transmission process. In thetransmission process, various types of components and a control signalwhich are supplied from the data providing server 10 are transmitted bya broadcast wave.

(Distribution Process)

Next, description will be given of a distribution process which isexecuted by the distribution server 30 in FIG. 18 with reference to theflowchart in FIG. 23. However, various types of components which areacquired from the data providing server 10 are accumulated in theaccumulation section 303 in the distribution server 30.

In step S301, the control section 301 constantly monitors thecommunication section 304 and determines whether a component isrequested from the receiving apparatus 40 via the network 90. In stepS301, the process waits for a request for a component from the receivingapparatus 40 and proceeds to step S302.

In step S302, the control section 301 reads out a component from theaccumulation section 303 according to the request from the receivingapparatus 40. In step S303, the control section 301 controls thecommunication section 304 and distributes the component (stream) whichis read out from the accumulation section 303 to the receiving apparatus40 via the network 90. When the process of step S303 ends, thedistribution process in FIG. 23 ends.

Above, description was given of a distribution process. In thedistribution process, according to a request from the receivingapparatus 40, various types of components (streams) which are providedfrom the data providing server 10 are distributed via the network 90.

(Receiving Process)

Finally, description will be given of a receiving process which isexecuted by the receiving apparatus 40 in FIG. 18 with reference to theflowchart in FIG. 24. The receiving process is executed, for example, ina case such as where the receiving apparatus 40 is started and a channelselection instruction is carried out due to an operation of a remotecontroller by the user.

In step S401, the tuner 402 receives a broadcast signal via the antenna401 and extracts and demodulates a broadcast signal of a service forwhich channel selection is instructed from the broadcast signal. Inaddition, the Demux 403 separates a BBP stream from the tuner 402 into acomponent and a control signal.

In step S402, the control section 407 selects an optimum component froma plurality of component candidates based on a control signal (signalinginformation in FIG. 10) from the Demux 403. In detail, since signalinginformation in FIG. 10 is acquired as a control signal in the controlsection 407, in the choices, firstly, an operation is controlledaccording to a selection determination on the basis of the number ofcomponents to be selected in a topAttribute attribute.

For example, in a case in which a selection determination is a userselection, the control section 407 displays information, which isdesignated as an attribute of a fixed selection target in each selectivecomponent group of a selective layer of the uppermost hierarchy, on aGUI screen and the user selects the selective component group(component). In addition, for example, in a case in which the selectiondetermination is automatic selection by the receiving device, thecontrol section 407 selects a selective component group (component) foreach selective component group of a selective layer of the uppermosthierarchy based on information which is designated as an attribute ofthe fixed selection target.

The component selection process is basically executed for each categoryof a component such as a video or an audio; however, a selectivecomponent group (component) is selected by moving to a new category in acase in which a view tag is designated as an attribute of a fixedselection target.

Next, in a case in which a plurality of composite component groupelements are present in the selected selective component group(component), the control section 407 selects a plurality of componentson which a designated component synthesis is to be performed fromcomponents of an adaptive selection target of an adaptive layer of alower hierarchy in a composite layer. Then, the control section 407controls the selection/synthesis sections 404 to 406, uses a pluralityof components which are adaptively selected, and performs a synthesisprocess.

Here, for example, in a case in which scalable is designated as anattribute of a synthesis target in a composite component group, a basestream which is transferred in a broadcast and an extended stream whichis transferred by communication are synthesized. In addition, forexample, in a case in which 3D is designated as an attribute in acomposite component group, a video for a right eye which is transferredin a broadcast and a video for a left eye which is transferred bycommunication are synthesized.

Here, description was given of a case in which a plurality of compositecomponent group elements are present; however, in a case in which thereis only one composite component group element, in a composite layer,optimum components are adaptively and successively selected fromcomponents of an adaptive selection target of an adaptive layer of alower hierarchy. In addition, in a case in which there is only onecomponent of an adaptive selection target in an adaptive layer, thiscomponent is selected every time. Furthermore, here, a component of theSPD and a representation element of the MPD are mapped by arepresentation ID, a component group of the SPD and an adaptation setelement of the MPD are mapped by an adaptation set ID, a componentcategory of the SPD and a group attribute of an adaptation set elementof the MPD are mapped by a group ID, and the SPD and the MPD aremutually coordinated. Due to this, it is possible to process components(objects) which are included in each layer in a cross-sectional mannerby sharing parameters of each layer of the SPD and the MPD.

When an optimum component is selected by the process in step S402, theprocess proceeds to step S403. In step S403, the video output section413 displays a video, which corresponds to the component of video orsubtitles selected by the process in step S402, on a display. Inaddition, in step S403, the audio output section 415 outputs a sound,which corresponds to an audio component selected by the process in stepS402, from a speaker. When the process in step S403 ends, the receivingprocess in FIG. 24 ends.

Above, description was given of a receiving process. In the receivingprocess, based on a control signal (signaling information in FIG. 10),an optimum component is selected from a plurality of receivablecomponent candidates in a broadcast or by communication and presented.Due to this, for example, in a case of selecting a desired televisionprogram, the user is able to watch and listen to video or sound whichcorresponds to the optimum component which is selected from theplurality of receivable component candidates.

Here, in the above description, “D”, which is an abbreviation ofDescription, is used as the title of the signaling data; however, thereare cases where “T”, which is an abbreviation of Table, is used. Forexample, there is a case in which the service configuration description(SCD) is described as a service configuration table (SCT). In addition,for example, there is a case in which the service parameter description(SPD) is described as a service parameter table (SPT). However, thedifference between the titles is a format difference between“Description” and “Table” and the substantial content of each signalingdata is not different.

<6. Configuration of Computer>

The series of processes described above is able to be executed byhardware and is also able to be executed by software. In a case ofexecuting the series of processes by software, a program whichconfigures the software is installed on a computer. FIG. 25 is a diagramwhich shows a configuration example of hardware of a computer whichexecutes the series of processes described above using a program.

In a computer 900, a central processing unit (CPU) 901, a read onlymemory (ROM) 902, and a random access memory (RAM) 903 are connected toeach other by a bus 904. An input and output interface 905 is furtherconnected with the bus 904. An input section 906, an output section 907,a recording section 908, a communication section 909, and a drive 910are connected with the input and output interface 905.

The input section 906 is formed of a keyboard, a mouse, a microphone,and the like. The output section 907 is formed of a display, a speaker,or the like. The recording section 908 is formed of a hard disk, anon-volatile memory, or the like. The communication section 909 isformed of a network interface and the like. The drive 910 drives aremovable media 911 such as a magnetic disk, an optical disc, amagneto-optical disc, or a semiconductor memory.

In the computer 900 which is configured as described above, the seriesof processes described above is performed by the CPU 901 loading aprogram which is stored in the ROM 902 or the recording section 908 intothe RAM 903 via the input and output interface 905 and the bus 904 andexecuting the program.

For example, it is possible to record the program executed by thecomputer 900 (the CPU 901) on the removable media 911 as a package mediaor the like to provide the program. In addition, it is possible toprovide a program via a wired or wireless transfer medium such as alocal area network, the internet, or digital satellite broadcasting.

In the computer 900, it is possible to install a program on therecording section 908 via the input and output interface 905 by mountingthe removable media 911 on the drive 910. In addition, it is possible toreceive a program in the communication section 909 via a wired orwireless transfer medium and install the program in the recordingsection 908. In addition thereto, it is possible to install a program onthe ROM 902 or the recording section 908 in advance.

Here, in the present specification, the processes which the computerperforms according to the program are not necessarily performed in timeseries in the described order as in the flowchart. That is, theprocesses which the computer performs according to the program includeprocesses which are executed simultaneously or individually (forexample, a simultaneous process or a process according to an object). Inaddition, the program may be processed by one computer (processor), ormay be dispersed and processed by a plurality of computers.

Here, embodiments of the present technique are not limited to theembodiments described above and various types of changes are possiblewithin a range which does not depart from the gist of the presenttechnique.

In addition, the present technique is able to have the followingconfigurations.

(1) A receiving apparatus including circuitry configured to receive adigital broadcast signal which uses an internet protocol (IP) transfermethod. Based on control information included in the digital broadcastsignal, acquire at least one of a broadcast component or a communicationcomponent, and control an operation of each section which performs apredetermined process relating to the acquired at least one component.The broadcast component and the communication component are associatedwith a component hierarchy including a first hierarchy level, a secondhierarchy level, and a third hierarchy level for each componentcategory, the third hierarchy level is for adaptively selecting thebroadcast component or the communication component, the second hierarchylevel is for synthesizing the component which is adaptively selected inthe third hierarchy level and a component which is not a target in thethird hierarchy into one synthesized component, and the first hierarchylevel is for selecting one of the synthesized component which issynthesized in the second hierarchy level, the component which isadaptively selected in the third hierarchy level, and a component whichis not a target in the second hierarchy level and the third hierarchylevel.(2) The receiving apparatus according to (1), in which the controlinformation includes information for managing the communicationcomponent in addition to the broadcast component.(3) The receiving apparatus according to (1) or (2), in which thecontrol information includes a media presentation description (MPD)which conforms to the moving picture expert group-dynamic adaptivestreaming over HTTP (MPEG-DASH) standard, a first identifier forassociating a component and a representation element of the MPD, asecond identifier for associating a group of components and anadaptation set element of the MPD, and a third identifier forassociating a component category and a group attribute of an adaptationset element of the MPD.(4) The receiving apparatus according to any one of (1) to (3), in whichthe control information includes first management information formanaging the broadcast component which is included in the digitalbroadcast signal and second management information for managing thecommunication component which is transferred via a communicationnetwork.(5) The receiving apparatus according to any one of (1) to (4), in whichthe control information describes a parameter relating to at least oneof a specific service or a component of the service, and a descriptorwhich describes information relating to an associated componenthierarchy of the component is arranged as the parameter relating to thespecific service.(6) The receiving apparatus according to any one of (1) to (4), in whichthe control information describes parameters relating to a specificservice, a component of the service, a group of components, and thecomponent category, and information relating to a component hierarchyassociated with the component is described.(7) The receiving apparatus according to any one of (1) to (4), in whichthe control information includes information relating to a synthesis ofa plurality of components in the second hierarchy level.(8) The receiving apparatus according to (3), in which the controlinformation includes a fourth identifier for a combination betweendifferent component categories with respect to the group of components.(9) The receiving apparatus according to any one of (1) to (8), in whichthe control information is transferred in a layer above an IP layeraccording to a protocol hierarchy of the IP transfer method, and acommon IP address is assigned to broadcast components of each service inthe control information.(10) A receiving method of a receiving apparatus, including receiving adigital broadcast signal which uses an IP transfer method. Based oncontrol information included in the digital broadcast signal, acquiring,by circuitry of the receiving apparatus, at least one of a broadcastcomponent or a communication component, and controlling, by thecircuitry, an operation of each section of the receiving apparatus whichperforms a predetermined process relating to the acquired at least onecomponent. The broadcast component and the communication component areassociated with a component hierarchy including a first hierarchy level,a second hierarchy level, and a third hierarchy level for each componentcategory, the third hierarchy level is for adaptively selecting thebroadcast component or the communication component, the second hierarchylevel is for synthesizing the component which is adaptively selected inthe third hierarchy level and a component which is not a target in thethird hierarchy into one synthesized component, and the first hierarchylevel is for selecting one of the synthesized component which issynthesized in the second hierarchy level, the component which isadaptively selected in the third hierarchy level, and a component whichis not a target in the second hierarchy level and the third hierarchylevel.(11) A transmitting apparatus including circuitry configured to acquirecontrol information; acquire a broadcast component of a service; andtransmit the control information in addition to the broadcast componentin a digital broadcast signal which uses an IP transfer method. Thebroadcast component and a communication component are associated with acomponent hierarchy structure including a first hierarchy level, asecond hierarchy level, and a third hierarchy level for each componentcategory, the third hierarchy level is for adaptively selecting thebroadcast component or the communication component, the second hierarchylevel is for synthesizing the component which is adaptively selected inthe third hierarchy level and a component which is not a target in thethird hierarchy level into one component, and the first hierarchy levelis for selecting one of the component which is synthesized in the secondhierarchy level, the component which is adaptively selected in the thirdhierarchy level, and a component which is not a target in the secondhierarchy level and the third hierarchy level.(12) The transmitting apparatus according to (11), in which the controlinformation includes information for managing the communicationcomponent in addition to the broadcast component.(13) The transmitting apparatus according to (11) or (12), in which thecontrol information includes a media presentation description (MPD)which conforms to the moving picture expert group-dynamic adaptivestreaming over HTTP (MPEG-DASH) standard, a first identifier forassociating a component and a representation element of the MPD, asecond identifier for associating a group of components and anadaptation set element of the MPD, and a third identifier forassociating a component category and a group attribute of an adaptationset element of the MPD.(14) The transmitting apparatus according to any one of (11) to (13), inwhich the control information includes first management information formanaging at least the broadcast component which is included in thedigital broadcast signal and second management information for managingonly the communication component which is transferred via acommunication network.(15) The transmitting apparatus according to any one of (11) to (14), inwhich the control information describes a parameter relating to at leastone of a specific service or a component of the service, and adescriptor which describes information relating to a component hierarchyassociated with the component is arranged as the parameter relating tothe specific service.(16) The transmitting apparatus according to any one of (11) to (14), inwhich the control information describes parameters relating to aspecific service, a component of the service, a group of components, andthe component category, and information relating to a componenthierarchy associated with the component is described.(17) The transmitting apparatus according to any one of (11) to (14), inwhich the control information includes information relating to asynthesis of a plurality of components in the second hierarchy level.(18) The transmitting apparatus according to (13), in which the controlinformation includes a fourth identifier for a combination betweendifferent component categories with respect to the group of components.(19) The transmitting apparatus according to any one of (11) to (18), inwhich the control information is transferred in a layer above an IPlayer according to a protocol hierarchy of the IP transfer method, and acommon IP address is assigned to the broadcast component of each servicein the control information.(20) A transmitting method including acquiring control information;acquiring, by circuitry of the transmitting apparatus, a broadcastcomponent of a service, and transmitting, by the circuitry, the controlinformation in addition to the broadcast component in a digitalbroadcast signal which uses an IP transfer method. The broadcastcomponent and a communication component are associated with a componenthierarchy structure including a first hierarchy level, a secondhierarchy level, and a third hierarchy level for each componentcategory, the third hierarchy level is for adaptively selecting thebroadcast component or the communication component, the second hierarchylevel is for synthesizing the component which is adaptively selected inthe third hierarchy level and a component which is not a target in thethird hierarchy level into one component, and the first hierarchy levelis for selecting one of the component which is synthesized in the secondhierarchy level, the component which is adaptively selected in the thirdhierarchy level, and a component which is not a target in the secondhierarchy level and the third hierarchy level.

It should be understood by those skilled in the art that variousmodifications, combinations, sub-combinations and alterations may occurdepending on design requirements and other factors insofar as they arewithin the scope of the appended claims or the equivalents thereof.

REFERENCE SIGNS LIST

-   1 Broadcast communication system-   10 Data providing server-   20 Transmitting apparatus-   30 Distribution server-   40 Receiving apparatus-   90 Network-   201 Component acquiring section-   202 Control signal acquiring section-   203 Mux-   204 Transmitting section-   301 Control section-   302 Component acquiring section-   303 Accumulation section-   304 Communication section-   402 Tuner-   403 Demux-   404, 405, 406 Selection/synthesis section-   407 Control section-   408 NVRAM-   409 Input section-   410 Communication section-   411 Demux-   412 Video decoder-   413 Video output section-   414 Audio decoder-   415 Audio output section-   416 Subtitles decoder-   421 Receiving buffer-   900 Computer-   901 CPU

1. A receiving apparatus comprising: circuitry configured to: receive adigital broadcast signal which uses an internet protocol (IP) transfermethod; and based on control information included in the digitalbroadcast signal, acquire a plurality of components via broadcast orcommunication, wherein the plurality of components is associated with acomponent hierarchy including a first hierarchy level, a secondhierarchy level, and a third hierarchy level for a video, audio, orclosed caption component category, the third hierarchy level is foradaptively selecting a component according to the control information,the second hierarchy level is for synthesizing, according to the controlinformation, the component which is adaptively selected in the thirdhierarchy level and a component which is not a target in the thirdhierarchy level a synthesized component, and the first hierarchy levelis for selecting, according to the control information, one of thesynthesized component, the component which is adaptively selected in thethird hierarchy level, and a component which is not a target in thesecond hierarchy level and the third hierarchy level.
 2. (canceled) 3.The receiving apparatus according to claim 1, wherein the controlinformation includes: a media presentation description (MPD) whichconforms to a moving picture expert group-dynamic adaptive streamingover HTTP (MPEG-DASH) standard, a first identifier for associating acomponent and a representation element of the MPD, a second identifierfor associating a group of components and an adaptation set element ofthe MPD, and a third identifier for associating a component category anda group attribute of an adaptation set element of the MPD.
 4. Thereceiving apparatus according to claim 1, wherein the controlinformation includes first management information for managing abroadcast component which is included in the digital broadcast signaland second management information for managing a communication componentwhich is transferred via a communication network.
 5. The receivingapparatus according to claim 1, wherein the control information includesa parameter relating to at least one of a specific service or acomponent of the service, and a descriptor which describes informationrelating to an associated component hierarchy of the component isarranged as the parameter relating to the specific service. 6-7.(canceled)
 8. The receiving apparatus according to claim 3, wherein thecontrol information includes a fourth identifier for a combinationbetween different component categories with respect to the group ofcomponents.
 9. The receiving apparatus according to claim 1, wherein thecontrol information is transferred in a layer above an IP layeraccording to a protocol hierarchy of the IP transfer method, and acommon IP address is assigned to broadcast components of each service inthe control information.
 10. A receiving method of a receivingapparatus, the method comprising: receiving a digital broadcast signalwhich uses an IP transfer method; and based on control informationincluded in the digital broadcast signal, acquiring, by circuitry of thereceiving apparatus, a plurality of components via broadcast orcommunication, wherein the plurality of components is associated with acomponent hierarchy including a first hierarchy level, a secondhierarchy level, and a third hierarchy level for a video, audio, orclosed caption component category, the third hierarchy level is foradaptively selecting a component according to the control information,the second hierarchy level is for synthesizing, according to the controlinformation, the component, level and a component which is not a targetin the third hierarchy level into a synthesized component, and the firsthierarchy level is for selecting, according to the control information,one of the synthesized component, the component which is adaptivelyselected in the third hierarchy level, and a component which is not atarget in the second hierarchy level and the third hierarchy level. 11.A transmitting apparatus comprising: circuitry configured to acquirecontrol information; acquire a broadcast component of a service; andtransmit the control information in addition to the broadcast componentin a digital broadcast signal which uses an IP transfer method, whereinthe broadcast component and a communication component are associatedwith a component hierarchy structure including a first hierarchy level,a second hierarchy level, and a third hierarchy level for a video,audio, or closed caption component category, the third hierarchy levelis for adaptively selecting the broadcast component or the communicationcomponent according to the control information, the second hierarchylevel is for synthesizing, according to the control information, thecomponent which is adaptively selected in the third hierarchy level anda component which is not a target in the third hierarchy level into asynthesized component, and the first hierarchy level is for selecting,according to the control information, one of the synthesized component,the component which is adaptively selected in the third hierarchy level,and a component which is not a target in the second hierarchy level andthe third hierarchy level.
 12. (canceled)
 13. The transmitting apparatusaccording to claim 11, wherein the control information includes: a mediapresentation description (MPD) which conforms to a moving picture expertgroup-dynamic adaptive streaming over HTTP (MPEG-DASH) standard, a firstidentifier for associating a component and a representation element ofthe MPD, a second identifier for associating a group of components andan adaptation set element of the MPD, and a third identifier forassociating a component category and a group attribute of an adaptationset element of the MPD.
 14. The transmitting apparatus according toclaim 11, wherein the control information includes first managementinformation for managing at least the broadcast component which isincluded in the digital broadcast signal and second managementinformation for managing the communication component which istransferred via a communication network.
 15. The transmitting apparatusaccording to claim 11, wherein the control information describes aparameter relating to at least one of a specific service or a componentof the service, and a descriptor which describes information relating toa component hierarchy associated with the component is arranged as theparameter relating to the specific service. 16-17. (canceled)
 18. Thetransmitting apparatus according to claim 13, wherein the controlinformation includes a fourth identifier for a combination betweendifferent component categories with respect to the group of components.19. The transmitting apparatus according to claim 11, wherein thecontrol information is transferred in a layer above an IP layeraccording to a protocol hierarchy of the IP transfer method, and acommon IP address is assigned to the broadcast component of each servicein the control information.
 20. A transmitting method for a transmittingapparatus, the method comprising: acquiring control information;acquiring, by circuitry of the transmitting apparatus, a broadcastcomponent of a service, and transmitting, by the circuitry, the controlinformation in addition to the broadcast component in a digitalbroadcast signal which uses an IP transfer method, wherein the broadcastcomponent and a communication component are associated with a componenthierarchy structure including a first hierarchy level, a secondhierarchy level, and a third hierarchy level for a video, audio, orclosed caption component category, the third hierarchy level is foradaptively selecting the broadcast component or the communicationcomponent according to the control information, the second hierarchylevel is for synthesizing, according to the control information, thecomponent which is adaptively selected in the third hierarchy level anda component which is not a target in the third hierarchy level into asynthesized component, and the first hierarchy level is for selecting,according to the control information, one of the synthesized component,the component which is adaptively selected in the third hierarchy level,and a component which is not a target in the second hierarchy level andthe third hierarchy level.
 21. The receiving method according to claim10, wherein the control information includes: a media presentationdescription (MPD) which conforms to a moving picture expertgroup-dynamic adaptive streaming over HTTP (MPEG-DASH) standard, a firstidentifier for associating a component and a representation element ofthe MPD, a second identifier for associating a group of components andan adaptation set element of the MPD, and a third identifier forassociating a component category and a group attribute of an adaptationset element of the MPD.
 22. The receiving method according to claim 10,wherein the control information includes first management informationfor managing a broadcast component which is included in the digitalbroadcast signal and second management information for managing acommunication component which is transferred via a communicationnetwork.
 23. The receiving method according to claim 10, wherein: thecontrol information includes a parameter relating to at least one of aspecific service or a component of the service, and a descriptor whichdescribes information relating to an associated component hierarchy ofthe component is arranged as the parameter relating to the specificservice.
 24. The receiving method according to claim 21, wherein thecontrol information includes a fourth identifier for a combinationbetween different component categories with respect to the group ofcomponents.
 25. The receiving method according to claim 10, wherein thecontrol information is transferred in a layer above an IP layeraccording to a protocol hierarchy of the IP transfer method, and acommon IP address is assigned to broadcast components of each service inthe control information.